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VASCULAR ENDOTHELIAL GROWTH FACTOR-X 

The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
5 designated "VEGF-X", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-X. 

Introduction 

10 Angiogenesis involves formation and proliferation of 
new blood vessels, and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair. 

15 Angiogenesis also features in the growth of human 

cancers which require continuous stimulation of blood 
vessel growth. Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis 
psoriasis and diabetic retinopathy, 

20 

Capillary vessels consist of endothelial cells which 
carry the genetic information necessary to proliferate 
to form capillary networks- Angiogenic molecules 
which can initiate this process have previously been 

25 characterised. A highly selective mitogen for 

vascular enothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et a J., "Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications''. Regulation of angiogenesis, by I.D. 

30 Goldberg and E.M. Rosen 1997 Birkhauser Verlag 

Basle/Switzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
4 6-4 9 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 

35 resistant to acidic pH and to heating and binds to 
immobilised heparin. 



VEGF-A has four different forms of 121, .165/ 189 and 
206 amino acids respectively due to alternative 
splicing, VEGF121 and VEGF165 are soluble and are 
capable of promoting angiogenesis, whereas VEGF189 and 
VEGF206 are bound to heparin containing proteoglycans 
in the cell surface. The temporal and spatial 
expression of VEGF has been correlated with 
physiological proliferation of the blood vessels 
(Gajdusek, CM., and Carbon, S.J., Ceil Physiol., 
139:570-579, (1989)); McNeil, P.L., Muthukrishnan, L., 
Warder, E., D'Amore, P. A., J. Cell. Biol., 109:811- 
822, (1989)). Its high affinity binding sites are 
localized only on endothelial cells in tissue sections 
(Jakeman, L.B., et al., Clin. Invest. 89:244-253 
(1989)), The growth factor can be isolated from 
pituitary cells and several tumor cell lines, and has 
been implicated in some human gliomas (Plate, K.H. 
Nature 359:845-848, (1992)). The inhibition of VEGF 
function by anti-VEGF monoclonal antibodies was shown 
to inhibit tumor growth in immune-deficient mice (Kim, 
K.J., Nature 362:841-844, (1993)), 

VEGF proteins have been described in the following 
patents and applications all of which are hereby 
incorporated by reference EP-0,506, 477, WO-95/24473, 
WO-98/28621, WO-90/13649, EP-0, 476, 983, EP-0, 550, 296, 
WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, 
WO-98/36075, WO-98/840124, WO-90/11084, WO-98/24811, 
WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, 
WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551. 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
as "VEGF-X", and the nucleic acid sequence encoding 
it, which has potentially significant benefits for the 
treatment of tumours and other conditions mediated by 
inappropriate angiogenic activity. 
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Summary of the Invention 

In the present application, there is provided a novel 
vascular endothelial growth factor, herein designated 
5 "VEGF-X", nucleic acid molecules encoding said growth 
factor, an expression vector comprising said nucleic 
acid molecule, a host cell transformed with said 
vector and compounds which inhibit or enhance 
angiogenesis. Also provided is the sequence of a CUB 
10 domain present in the sequence of VEGF-X which domain 
itself prevents angiogenesis and which is used to 
treat diseases associated with inappropriate 
vascularisation or angiogenesis. 

15 Detailed Description of the Invention 

Therefore, according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
encoding a VEGF-X protein or a functional equivalent, 

20 fragment, derivative or bioprecursor thereof, said 
protein comprising the amino acid sequence from 
position 23 to 345 of the amino acid sequence 
illustrated in Figure 10. Alternatively, the nucleic 
acid molecule of the invention encodes the complete 

25 sequence identified in Figure 10 and which 

advantageously includes a signal peptide to express 
said protein extracellularly. Preferably, the nucleic 
acid molecule is a DNA and even more preferably a cDNA 
molecule. Preferably, the nucleic acid molecule 

30 comprises the nucleotide sequence from position 257 to 
1291 of the nucleotide sequence illustrated in Figure 
9. In a preferred embodiment the nucleic acid is of 
mammalian origin and even more preferably of human 
origin. 

35 

In accordance with the present invention a functional 
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equivalent should be taken to mean a protein, or a 
sequence of amino acids that have similar function to 
the VEGF-X protein of the invention. 

5 Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
stringency conditions, which conditions would be well 
10 known to those skilled in the art. 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are .stable. 
The stability of hybrids is reflected in the melting 
15 temperature (Tm) of the hybrids. Tm can be 
approximated by' the formula: 

81 . 5°C+16. 6 (log 10 [Na + ] +0.41 (%G&C) -600/1 

20 wherein 1 is the length of the hybrids in nucleotides, 
Tm decreases approximately by 1-1. S°C with every 1% 
decrease in sequence homology. 

The term "stringency" refers to the hybridisation 
25 conditions wherein a single-stranded nucleic acid 

joins with a complementary strand when the purine or 
pyrimidine bases therein pair with their corresponding 
base by hydrogen bonding* High stringency conditions 
favour homologous base pairing whereas low stringency 
30 conditions favour non-homologous base pairing. 

"Low stringency" conditions comprise, for example, a 
temperature of about 37°C or less, a formamide 
concentration of less than about 50%, and a moderate 
35 to low salt (SSC) concentration; or, alternatively, a 
temperature of about 50°C or less, and a moderate to 
high salt (SSPE) concentration, for example 1M NaCl. 
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xx High stringency" conditions comprise, for example, a 
temperature of about 42 °C or less, a formamide 
concentration of less than about 20%, and a low salt 
(SSC) concentration; or, alternatively, a temperature 
5 of about 65°C, or less, and a low salt (SSPE) 
concentration. For example, high stringency 
conditions comprise hybridization in 0.5 M NaHPO,, 7% 
sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C 
(Ausubel, F.M. et a2. Current Protocols f,n Molecular 
10 Biology , Vol. I, 1989; Green Inc. New York, at 
2.10.3). 

W SSC" comprises a hybridization and wash solution. A 
stock 20X SSC solution contains 3M sodium chloride, 
15 0.3M sodium citrate, pH 7.0. 

"SSPE" comprises a hybridization and wash solution. A 
IX SSPE solution contains 180 mM NaCI, 9mM Na 2 HP0« and 
1 mM EDTA, pH 7.4. 

20 

The nucleic acid capable of hybridising to nucleic 
acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
25 nucleotide sequences according to the invention. 

The antisense molecule capable of hybridising to the 
nucleic acid according to the invention may be used as 
a probe or as a medicament or way be included in a 
30 pharmaceutical composition with a pharmaceutical^ 
acceptable carrier, diluent or excipient therefor. 

The term '"homologous" describes the relationship 
between different nucleic acid molecules or amino acid 
35 sequences wherein said sequences or molecules are 

related by partial identity or similarity at one or 
more blocks or regions within said molecules or 
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sequences. 

The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
5 molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

Therefore, according to a further aspect of the 
present invention, there is provided a VEGF-X protein, 

10 or a functional equivalent, derivative or bioprecursor 
thereof, comprising an amino acid sequence from 
position 23 to 345 of the sequence as illustrated in 
Figure 10, or alternatively which amino acid sequence 
comprises the complete sequence of Figure 10* A 

15 further aspect of the invention comprises a VEGF-X 
protein, or a functional equivalent, derivative or 
bioprecusor thereof, encoded by a nucleic acid 
molecule according to the invention. Preferably, the 
VEGF-X protein encoded by said nucleic acid molecule 

20 comprises the sequence from position 23 to 345 of the 
amino acid sequence as illustrated in Figure 10, or 
which sequence alternatively comprises the sequence of 
amino acids of Figure 10. 

25 The DNA molecules according to the invention may, 

advantageously, be included in a suitable expression 
vector to express VEGF-X encoded therefrom in a 
suitable host. Incorporation of cloned DNA into a 
suitable expression vector for subsequent 

30 transformation of said cell and subsequent selection 
of the transformed cells is well known to those 
skilled in the art as provided in Sambrook et al. 
(1989), molecular cloning, a laboratory manual, Cold 
Spring Harbour Laboratory Press. 

35 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
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the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
5 the components described are in a relationship 

permitting them to function in their intended manner. 
Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 

10 aspect, the invention provides a. process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 

15 the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides. 

The vectors may be, for example, plasmid, virus or 
20 phage vectors provided with an origin of replication, 
and optionally a promoter for the expression of said 
nucleotide and optionally a regulator, of the promoter. 

The vectors may contain one or more selectable 
25 markers, such as, for example, ampicillin resistance . 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 

30 binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a heterologous or 

35 homologous promoter for RNA polymerase II, a 

downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 
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ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well known in the art. 

5 Nucleic acid molecules according to the invention may 
be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

10 

In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular, substitutions in cases which result in a 

15 synonymous codon (a different codon specifying the 

same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions. The term 
"nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 

20 given regarding base variations. 

The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 

25 the invention and preferably from 10 to 50 nucleotides 
even more preferably, the nucleic acid sequence 
comprise the sequences illustrated in Figure 3. These 
sequences may, advantageously be used as probes or 
primers to initiate replication, or the like. Such 

30 nucleic acid sequences may be produced according to 
techniques well known in the art, such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 

35 These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 



/ 
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formation between the probe and any nucleic acid in 
the sample. 

The nucleic acid sequences according to this aspect of 
5 the present invention comprise the sequences of 
nucleotides illustrated in Figures 3 and 5. 

According to the present invention these probes may be 
anchored to a solid support. Preferably, they are 

10 present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in situ on the; array. (See Lockhart et 
al., Nature Biotechnology, vol. 14, December 1996 

15 "Expression monitoring by hybridisation to high 

density oligonucleotide arrays". A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. 

20 The nucleic acid sequences, according to the invention 
may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 

25 nucleotides to a region of the gene which is desired 
to be cloned, bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which brings about amplification of the 

30 desired region, isolating the amplified region or 

fragment and recovering the amplified DNA. Generally, 
such techniques are well known in the art, such as 
described in Sambrook et al. (Molecular Cloning: a 
Laboratory Manual, 1989) . 

35 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label- Suitable 
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labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
may be detected using known techniques per se. 

Advantageously, human allelic variants or 
polymorphisms of the DNA molecule according to the 
invention may be identified by, for example, probing 
cDNA or genomic libraries from a range of individuals, 
for example, from different populations. Furthermore, 
nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 
Sanger Dideoxy chain termination method, which may, 
advantageously, ascertain any predisposition of a 
patient to certain disorders associated with a growth 
factor according to the invention. 

The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Conservative amino 
acid substitution refers to a replacement of one or 
more amino acids in a protein as identified in Table 
1. Proteins or polypeptides according to the invention 
further include -variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said proteins or 
polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 
the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant/ synthetic or naturally occurring, but is 
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preferably recombinant. 

The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
5 medicament for treating cancer or other diseases or 
conditions associated with expression of VEGF-X 
protein. 

Advantageously, .the nucleic acid molecule or the 
10 protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
. excipient therefor. 

15 The present invention is further directed to 

inhibiting VEGF-X in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 

20 based on binding of a polynucleotide to DNA or RNA- 
For example, the 5' coding portion or the mature DNA 
sequence, which encodes for the protein of the present 
invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 

25 A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al. Nucl. Acids Res., 
6:3073 (1979); Cooney et al., Science, 241:456 (1988); 
and Dervan et al., Science, 251: 1360 (1991), thereby 

30 preventing transcription and the production of VEGF-X. 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF-X protein (antisense - Okano, 
J. Neurochenw, 56:560 (1991)/ Oligodeoxynucleotides as 

35 Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 
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Alternatively, the oligonucleotide described above can 
be delivered to cells by procedures in the art such 
that the anti-sense RNA and DNA may be expressed in 
vivo to inhibit production of VEGF-X in the manner 
described above. 

Antisense constructs to VEGF-X, therefore, may inhibit 
the angiogenic activity of VEGF-X and prevent the 
further growth of or even regress solid tumours, since 
angiogenesis and neovascularization are essential 
steps in solid tumour growth. These antisense 
constructs may also be used to treat rheumatoid 
arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

A further aspect of the invention provides a host cell 
or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
method of producing VEGF-X, which comprises recovering 
any expressed VEGF-X from the host or organism 
transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-X 
protein according to the invention. The term 
v transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
expression of VEGF-X or proteins having the same 
function and/or activity. The transgene, may include, 
for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
state. Preferably, the transgene comprises the 
nucleic acid sequence encoding the proteins according 
to the invention as described herein, or a functional 
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fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic .acid 
coding for the proteins according to the invention or 
a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 
deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

VEGF-X protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also forms part of the 
present invention. 

Antibodies to the protein or polypeptide of the 
present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse or rabbit, with the 
polypeptide according to the invention or an epitope 
thereof and recovering immune serum. Monoclonal 
antibodies may be prepared according to known 
techniques such as described by Kohler R. and Milstein 
C, Nature (1975) 256, 495-497. Advantageously, such 
antibodies may be included in a kit for identifying 
VEGF-X in a sample, together with means for contacting 
the antibody with the sample. 

Advantageously, the antibody according to the 
invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other diseases associated with expression of VEGF-X. 
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The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutical^ acceptable carrier diluent or 
excipient therefor. 

5 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
interactions using the two-hybrid vector system first 
proposed by Chien et ai., (1991) Proc. Natl. Acad. 
10 Sci. USA 88 : 9578-9582. 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 

15 comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DNA 

20 sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 

25 a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 

30 investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

35 An example of such a technique utilises the GAL 4 

protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 
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domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 
residues encoding the DNA binding domain of GAL4. 
These binding domain residues may be fused to a known 
protein encoding sequence, such as for example, the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 
binding domain of GAL4. These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 
reporter molecule in a GAL -4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
p-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

A further aspect of the present invention also 
provides a method of identifying VEGF-X in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
any binding of any proteins to said antibody. A kit 
for identifying the presence of VEGF-X in a sample is 
also provided comprising an antibody according to the 
invention and means for contacting said antibody with 
said sample. 

VEGF-X may be recovered and purified from recombinant 
cell cultures by methods known in the art, including 
ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, • 
phosphocellulose chromatography, hydrophobic 
interaction chromatography, -affinity chromatography, 
hydroxyapatite chromatography and lectin 
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chromatography . 

The VEGF-X protein of the present invention may be a 
naturally purified product, or a product of chemical 
5 synthetic procedures, or produced by recombinant 

techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
10 polypeptides of the present invention may be 

glycosylated with mammalian or other eukaryotic 
carbohydrates or may be non-glycosylajted, 

VEGF-X is particularly advantageous as a wound healing 
IS agent, where, for example, it is necessary to re- 

vascularize damaged tissues, or where new capillary 
angiogenesis is important. Accordingly, VEGF-X may be 
used for treatment of various types of wounds such as 
for example, dermal ulcers, including pressure sores,. 
20 venous ulcers, and diabetic ulcers* In addition, it 
can be used in the treatment of full-thickness burns 
and injuries where angiogenesis is desired to prepare 
the burn in injured sites for a skin graft and flap. 
In this case, VEGF-X or the nucleic acid encoding it 
25 may be applied directly to the wound. VEGF-X may be 
used in plastic surgery when reconstruction is 
required following a burn, other trauma, or even for 
cosmetic purposes. 

30 An important application of VEGF-X is to induce the 
growth of damaged bone, periodontium or ligament 
tissue. For example, it may be used in periodontal 
disease where VEGF-X is applied to the roots of the 
diseased teeth, leading to the formation of new bone 

35 and cementum with collagen fibre ingrowths. It can be 
used for regenerating supporting tissues of teeth, 
including alveolar bone, cementum and periodontal 
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ligament , that have been damaged by disease and 
trauma. 

Since angiogenesi's is important in keeping wounds 
5 clean and non-infected, VEGF-X may be used in 

association with surgery and following the repair of 
cuts. It should be particularly useful in the 
treatment of abdominal wounds where there is a high 
risk of infection. 

10 

VEGF-X can also be used for the promotion of 
endothelialization in vascular graft surgery. In the 
case of vascular grafts using either transplanted or 
synthetic material, VEGF-X may be applied to the 

15 surface of the graft or at the junction to promote the 
growth of the vascular endothelial cells. One 
derivation of this is that VEGF-X can be used to 
repair the damage of myocardial and other occasions 
where coronary bypass surgery is needed by stimulating 

20 the growth of the transplanted tissue. Related to 
this is the use of VEGFX to repair the cardiac 
vascular system after ischemia. 

The protein of the present invention may also be 
25 employed in accordance with the present invention by 
expression of such protein in vivo, which is often 
referred to as v gene therapy". 

Thus, for example, cells such as bone marrow cells may 
30 be engineered with a polynucleotide (DNA or RNA) 

encoding for the protein ex vivo as defined herein, 
the engineered cells are then provided to a patient to 
be treated with the polypeptide. Such methods are 
well-known in the art. For example, cells may be 
35 engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding for the 
protein of the present invention. 
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Similarly, cells may be engineered in vivo for 
expression of the protein in vivo, for example, by 
procedures known in the art. 

5 A further aspect of the invention comprises a method 
of treating a disorder mediated by expression of a 
protein according to the invention , by administering 
to a patient an amount of an antisense molecule as 
described herein, in sufficient concentration to 
10 alleviate or reduce the symptoms of said disorder. 

Compounds which inhibit or enhance angiogenesis may be 
identified by providing a host cell or organism 
according to the invention or a transgenic cell, 

15 tissue or organism according to the invention, 

contacting a test compound with said cell, tissue or 
organism and monitoring for the effect of said 
compound compared to a cell tissue or organism which 
has not been contacted with said compound. These 

20 compounds may themselves be used as a medicament or 

included in a pharmaceutical composition for treatment 
of disorders mediated by inappropriate vascularisation 
or angiogenic activity. 

25 The present inventors have also, advantageously, 

identified in the sequence encoding the VEGF-X protein 
a CUB domain, which has heretofore not previously been 
identified in VEGF-type growth factors. The VEGF-X 
protein may therefore exert dual regulatory effects 

30 via interaction with the VEGF tyrosine kinase 

receptors or with neuropilin receptors mediated by the 
CUB domain. Thus, the sequence encoding said CUB 
domain may be included in an expression vector for 
subsequent transformation of a host cell, tissue or 

35 organism. 

VEGF-X or fragments thereof may be able to modulate 



- 19 - 



the effects of pro-angiogenic growth factors such as 
VEGF as indicated in the findings presented in the 
examples below that the N-terminal part of the VEGF-X 
protein, a CUB-like domain, is able to inhibit VEGF- 
stimulated proliferation of HUVECs. VEGF-X or 
fragments thereof may therefore be useful in therapy 
of conditions involving inappropriate angiogenesis. 
Inhibition of the angiogenic activity of VEGF has 
been linked with inhibition of tumour growth in 
several models eg Kim K. J. et al, Nature 362:841- 
844, (1993), Additionally, agents able to inhibit 
angiogenesis would be expected to be useful in 
treating other angiogenesis-dependent diseases such a 
retinopathy, osteoarthritis and psoriasis (Foltanan, 
J., Nature Medicine 1:27-31, (1995). 

As identified in more detail in the Examples 
described herein the present inventors have 
surprisingly identified that the CUB domain of VEGF-X 
is able to inhibit stimulation of proliferation of 
HUVECs induced by either VEGF or bFGF. The CUB domain 
may, therefore, be utilised as a therapuetic agent 
for inhibition of angiogenesis and for treatment of 
condition associated with inappropriate 
vascularisation or angiogenesis. 

Therefore according to a further aspect of the 
invention there is provided a method of inhibiting 
angiogenic activity and inappropriate vascularisation 
including formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of a polypeptide having an amino acid 
sequence from position 40 to 150 of the sequence 
illustrated in Figure 10 or a nucleic acid molecule 
encoding the CUB domain according to the invention in 
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sufficient concentration to reduce or prevent said 
angiogenic activity. 

Furthermore there is also provided a method of 
5 treating or preventing any of cancer, rheumatoid 

arthritis, psoriasis and diabetic retinopathy, said 
method comprising administering to said subject an 
amount of a polypeptide having an amino acid sequence 
from position 40 to 150 of the sequence illustrated 
10 in Figure 10 or a nucleic acid molecule encoding the 
CUB domain according to the invention in sufficient 
concentration to treat or prevent said disorders. 

The CUB domain may also be used to identify compounds 
15 that inhibit or enhance angiogenic activity such as 

inappropriate vascularisation, in a method comprising 
contacting a cell expressing a VEGF receptor and/or a 
neuropilin 1 or 2 type receptor with said compound in 
the presence of a VEGF-X protein according to the 
20 invention and monitoring for the effect of said 

compound or said cell when compared to a cell which 
has not been contacted with said compound. Such 
compounds may then be used as appropriate to prevent 
or inhibit angiogenic activity to treat the disorders 
25 or conditions described herein, or in a 

pharmaceutical composition. An antibody to said CUB 
domain may also be useful in identifying other 
proteins having said sequences. 
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Deposited Plasmids 

Date of Deposit Amission No. 

Plasmid VEGFX/pCR2.1 
5 1T0P0 FL 1 March 1999 LMBP 3925 

Plasmid VEGFX/pRSETB BD 

amino acids 1 March 1999 LMBP 3926 

10 G230-G345 

Plasmid VEGFX/pcR, 2.1 

FL Clone 9 20 October 1999 LMBP 3977 

IS Plasmid VEGF-X CUB 

PET22b 20 December 1999 

The above plasmids were deposited at the Belgian 
Coordinated Collections of Microorganisms (BCCM) at 
20 Laboratorium Voor Moleculaire Biologie- 

Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in 
accordance with the provisions of the Budapest Treaty 
of 28 April 1977. 

25 The invention may be more clearly understood with 
reference to the accompanying example, which is 
purely exemplary, with reference to the accompanying 
drawings, wherein: 

30 Figure 1: is a DNA sequence identified in the 

Incyte LifeSeq™ database coding for a 
novel VEGF-X protein. 



35 



Figure 2: 



is an illustration of amino acid 
sequence of the nucleic acid sequence 
of Figure 1. 
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Figure 3; 



is an illustration of PCR primer 
sequences utilised to identify the 
VEGF-X protein according to the 
invention. 



Figure 4 ; 



10 



is a diagrammatic illustration of the 
spatial relationships in the VEGF-X 
sequence of the clones identified 
using the PCR primer sequences of 
Figure 3. 



Figure 5: 



15 



is an illustration of the nucleotide 
sequences of the 5' RACE primers used 
to identify the 5' end of the VEGF-X 
open reading frame. 



Figure 6: is an illustration of the sequence 

obtained from the RACE experiment. 

20 Figure 7; is an illustration of the nucleotide 

sequences obtained from the search of 
LifeSeq™ database using the sequence 
in Figure 6. 



25 



Figure 8; 



is an illustration of the primers used 
to clone the entire coding sequence of 
VEGF-X. 



30 



Figure 9: is an illustration of the entire 

coding sequence of VEGF-X, 
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Figure 10: is an illustration of the predicted 

amino acid sequence of the nucleotide 
sequence of Figure 9- 

Figure 11: is an alignment of the sequence of 
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Figure 13: 

10 



15 

Figure 14: 

20 

Figure 15: 

25 

Figure 16: 



30 

Figure 17: 
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Figure 10 with the sequences of VEGF-A 
to D. 

is an illustration of variant 
sequences of the VEGF-X protein 
according to the invention, 

is an illustration of the 
oligonucleotide primers used for 
E.coli expression of VEGF-X domains 
and for expression of the full length 
sequence of VEGF-X in a 
baculovirus/insect cell expression 
system- 
depicts nucleic acid sequences of 18 
human EST clones obtained from a BLAST 
search of the LifeSeq™ database used 
to identify the full sequence encoding 
VEGF-X. 

depicts the nucleotide sequences of 50 
human EST clones obtained from the 
LifeSeq™ database. 

is an illustration of nucleotide 
sequences utilised as primers to 
identify the nucleotide sequence 
encoding VEGF-X, 

is a nucleotide sequence coding for a 
partial VEGF-X protein according to 
the invention. 

is an illustration of a partial 
nucleotide sequence encoding VEGF-X 
protein according to the invention. 
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Figure 19: 

5 

Figure 20: 

10 
15 

Figure 21: 

20 

25 Figure 22: 
30 



is an illustration of a DNA and 
polypeptide sequence used for 
mammalian cell expression of VEGF-X. 
The predicted VEGF-X signal sequence 
is in lower case letters. The C- 
terminal V5 epitope and His 6 sequences 
are underlined. 

is an illustration of a DNA and 
polypeptide sequence used for 
baculovirus/insect cell expression of 
VEGF-X- In the polypeptide sequence 
the signal sequence is shown in lower 
case. The N-terminal peptide tag 
added to the predicted mature VEGF-X 
sequence is* underlined. 

is an illustration of a DNA and 
polypeptide sequence used for £. coli 
expression of VEGF-X. The polypeptide 
sequences at the N- and C- termini 
derived from the MBP fusion and His 6 
tag respectively are underlined. 

illustrates the disulphide-linked 
dimerisation of VEGF-X. Protein 
samples were analysed by SDS-PAGE. 
Prior to loading the .gel, samples were 
heated to 95°C for 5 minutes in sample 
buffer in the presence (+) or absence 
(-) of reducing agent. (A) samples 
from COS cell expression of a C- 
terminally V5/His6 peptide-tagged 
construct. The left hand panel is 
total conditioned medium, the right 
hand panel is material purified on 
Nickel agarose resin. Reduced monomer 
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Figure 23: 

20 



25 



Figure 24: 

30 



35 

Figure 25: 



- ?5 - 

and putative disulphide-linked, non- 
reduced dimer are indicated by arrows. 
There appears to be proteolysis of the 
protein during purification. Gels were 
blotted onto nylon membranes and 
protein detected with an anti V5 
monoclonal antibody. (B) Samples from 
E.coli expression of a maltose-binding 
protein/His6 dual fusion construct. M 
indicates the molecular weight markers 
(Benchmark/ Lif eTechnologies) . The 
gel was stained with Coomassie Blue by 
standard procedures. The fusion 
protein has an apparent molecular 
weight of 80kDa. 

illustrates the glycosylation of VEGF- 
X. VEGF-X was purified from the 
culture supernatant of COS cells 
transfected with the pcDNA6/V5-His 
construct. Supernatants were 
harvested 72h post-transf ection and 
purified on nickel resin. Samples 
were then treated with EndoH (+) or 
untreated (-) before SDS-PAGE and 
blotting, as described in the legend 
to Figure 22. 

is an illustration of the DNA and 
polypeptide sequence used for E* coli 
expression of the VEGF-like domain of 
VEGF-X. Polypeptide sequences at the 
N-terminus of the protein derived from 
the vector are underlined. 

shows expression of the VEGF-X VEGF 
domain in E. coli. Lane l-10pl broad 
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range marker (New England Biolabs), 
lane 2-10ul unreduced sample, lane 3- 
lOpl reduced sample. The reduced PDGF 
domain protein (lane 3) has an 
5 apparent molecular weight of 

approximately 19kDa on SDS-PAGE. 

Figure 26: illustrates a DNA and polypeptide 

sequence used for E. coli expression 
10 of the CUB-like domain of VEGF-X. The 

polypeptide sequence at the N-terminus 
derived from the vector-encoded signal 

■ - • and the introduced His 6 tag are 

underlined. 

15 

Figure 27: shows expression of the VEGF-X CUB 

domain in E. coli. The CUB domain 
protein was purified on Nickel chelate 
resin. The protein migrates at 
20 approximately 23kDa on SDS-PAGE. 

Figure 28: illustrates the effect of truncated 
VEGF-X (CUB domain) on HUVEC 
proliferation. (A) Human Umbilical 
25 Vein Endothelial Cells (one-day- 

treatment) . (B) Human Umbilical Vein 
Endothelial Cells (24-hour starving 
followed by one-day-treatment) . (C) 
Effect of VEGF-A 165 and VEGF-X CUB 
domain on the proliferation of HUVEC 
(two-day-treatment) . 



30 



Figure 29: depicts the tissue distribution of 

VEGF-X mRNA analysed by Northern 
35 blotting and RT-PCR in (A) normal 

tissues and (B) tumour tissue and cell 
lines. 



WO 00/37641 



- 27 - 



PCT/US99/30503 



Figure 30: 

5 
10 
15 

Figure 31: 

20 

Figure 32: 

25 

30 Figure 33: 
35 

Figure 34: 



depicts the partial intron/exon 
structure of the VEGF-X gene, (A) 
Genomic DNA sequences of 2 exons 
determined by sequencing; exoh 
sequence is in upper case, intron 
sequence is in lower case. (B) Shows 
the location of splice sites within 
the VEGF-X cDNA sequence. The 
location of mRNA splicing events is 
indicated by vertical lines. The 
cryptic splice donor/acceptor site at 
nt. 998/999 (diagonal lines) gives 
rise to the splice variant forms of 
VEGF-X • No splice site information is 
given for the region shown in italics. 

is a graphic representation of the 
effect of FL- VEGF-X on HuVEC 
proliferation: (24 hour serum 
starvation followed by one day 
treatment) . 

is a graphic representation of the 
combined effect of truncated VEGF-X 
(CUB domain) and human recombinant 
VEGF 165 on HuVEC proliferation: (24 hour 
serum starvation followed by two day 
treatment) . 

is a graphic representation of the 
combined effect of the CUB domain and 
human recombinant bFGF on HuVEC 
proliferation: (24 hour serum 
starvation followed by two day 
treatment) . 

is a graphic representation of the 
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results of a LDH assay for testing 
cytotoxicity of the CUB domain or the 
CUB domain with rhVEGF 165 . 

Figure 35: is a graphic representation of the 

results obtained from a LDH assay for 
testing cytotoxicity of the CUB domain 
or CUB domain with rh-bFGF. 

A BLAST (Basic Local Alignment Search Tool; Altschul 
et al., 1990 J- Mol. Biol. 215, 403-410) search was 
performed in the proprietary- LifeSeq™ human EST 
database (Incyte Pharmaceuticals/ Inc., Palo Alto, 
CA, USA) . BLAST produces alignments of both 
nucleotide and amino acid sequences to determine 
sequence similarity. Because of the local nature of 
the alignments, BLAST is especially useful in 
determining exact matches or in identifying 
homologues. While it is useful for matches which do 
not contain gaps, it is inappropriate for performing 
motif-style searching. The. fundamental unit of BLAST 
algorithm output is the High-scoring Segment Pair 
(HSP) . 

Eighteen human EST clones (Figure 14) with high 
similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones 
(Figure 15) were identified using these sequences as 
query sequences, allowing us to deduce the putative 
sequence for the new VEGF-X protein. The sequences 
obtained were compared to known sequences to 
determine regions of homology and to identify the 
sequence as a novel VEGF-type protein. Using the DNA 
sequence information in the databases we were able to 
prepare suitable primers having the sequences of 
VEGF-X 1-10 illustrated in Figure 3 for use in 
subsequent RACE experiments to obtain the complete 
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DNA sequence for the VEGF-X gene. 
Cloning 

A profile was developed based on the VEGF-like domain, 
in existing VEGF sequences (VEGF-A, B, C and D) . 
This was used to search the public databases and the 
Incyte LifeSeq™ database. No significant novel 
matching sequences were found in the public 
databases. All of the matching .sequences found in 
the LifeSeq™ database (-1000) were assembled to give 
a smaller number of sequences (-30) , which included 
the known VEGFs and a potential novel VEGF (figures 
1 and 2) . This sequence was named VEGF-X . 

Oligonucleotides were designed to amplify the VEGF-X 
sequence from cDNA (figure 3) . The ESTs found in 
LifeSeq™ were from a range of tissues, with a slight 
predominance of sequences from ovary, testis, 
placenta and lung (Figure 14 and 15) . Accordingly 
the oligonucleotides were used to amplify cDNA 
derived from lung and placenta. First-round PCR 
products were found at ~200bp larger than the 
expected sizes, while 3 major species appeared after 
a second round of PCR amplification, the smallest of 
which was of the expected size. These fragments were 
cloned and sequenced. The smallest fragment did 
indeed have the sequence originally identified from 
the LifeSeq database, while the others contained 
insertions (figure 4) . 

As. the first round of amplification suggested that 
the major species found in cDNA from ovary and 
placenta was not that originally identified in the 
LifeSeq™ database, the focus of effort was switched 
to the presumed major species (it seemed likely that 



clones 57, 25-27 and 2.1Jcb clones 1-3 in fig 4 
represented the major mRNA species). Conceptual, 
translation of the DNA sequences of these cloned PGR 
fragments indicated that the complete open reading 
frame was not present in the clones or in the 
sequence from LifeSeq™. While all clones contained 
the same sequence in the region of the translation 
termination codon, indicating that the end of the 
open reading frame had been identified, the 5' end of 
the open reading frame had not been cloned. 5' RACE 
experiments were therefore carried out in order to 
find the start of the reading frame. PCR primers 
designed for RACE experiments are shown in figure 5. 
RACE PCR products were sequenced directly. Sequence 
could be obtained from the 3' end of these RACE 
products but not from the 5' end; probably because 
the products were not cloned and were therefore 
heterogeneous at the 5' end. This new sequence was 
assembled with the existing cloned sequence to give 
the sequence shown in figure 6. Searching the 
LifeSeq™ database with this sequence identifies ESTs 
which extend the sequence a further 140bp in the 5' 
direction and a further 160bp in the 3' direction 
(figure 7) . This longer contig was used to design 
oligonucleotide primers to amplify the entire coding 
sequence (these primer sequences are shown in figure 
8). PCR was carried out using primers 5 f -l and 
vegfXIO (in order to clone a "full-length" cDNA) , and 
with primers 5'-l and vegfX6 (in order to clone the 
full coding region/ see figure 3 for sequences of 
vegfXIO and vegfX6) . A number of clones were 
obtained for the shorter fragment, of which clones 4 
and 7 contain no PCR errors (sequence of clones 4 & 7 
in figure 9) . A single clone was obtained for the 
longer fragment (clone 9) , but this sequence appears 
to contain 2 PCR errors. 
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The predicted polypeptide from these longer contigs 
is shown in figure 10. Amino acids 1-22 are 
predicted to encode a signal sequence (von Heijne, 
1986, Nucleic Acids Res. 14, 4683-4690). Figure 11 
shows an alignment of the protein sequence with VEGFs 
A-D. The region homologous to the other VEGFs is 
located towards the C-terminus of the protein. As 
the VEGF homology domain is expected to belong to the 
TGF-beta superfamily of growth factors and to consist 
of a dimer containing both intra- and intermolecular 
disulphide bonds, initial alignments focussed on the 
cysteines. However, mapping of the sequence onto the 
known x-ray structure of the VEGF-A receptor-binding 
domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 
94, 7192-7197) suggests that the alignment in figure 
11 is plausible, as the extra 4 cysteine residues 
within the VEGF-homology region of VEGF-X (compared 
to this region of VEGF-A) correspond to residues 
which are spatially close in VEGF-A, and may 
therefore be able to form disulphide bonds. 

A search of the PFAM database of protein domains with 
the full-length polypeptide sequence from figure 10 
identifies two domain consensus sequences within the 
polypeptide. The more C-terminal domain is a "VEGF" 
domain: (the known VEGFs all contain this domain and 
the structure of this region of VEGF-A is similar to 
that of PDGF) . Additionally towards the N-terminus 
of the polypeptide there is a CUB domain (amino acids 
-40-150) • The CUB domain is a 100-110 amino acid 
extracellular domain found in a number of 
developmentally- regulated proteins. When the full- 
length protein is used to search the protein 
databases using the BLAST 2 algorithm, the scores for 
matches to CUB domain-containing proteins are more 
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significant than those to the other VEGFs. 
Interestingly, the most significant matches are to 
the CUB domains of Neuropilins, and Neuropilin-1 was 
recently identified as a receptor of one of the VEGF- 
A isoforms VEGF-A U5 (Soker et al. (1998) Cell 92, 735- 
745) . 

Assuming that the variant sequences isolated by PGR 
(i.e. the smaller PGR fragments) use the same 
translation initiation site as the full-length 
sequence, they would result in production of the 
variant proteins shown in figure. 12. It may be 
significant that both of these variant proteins 
retain the CUB domain and delete all or part of the 
VEGF-like domain. The production of these variant 
sequences can be explained by the use of a cryptic 
splice donor/acceptor site within the VEGF-X sequence 
(figure 30B, between nt. 993/999): one variant arises 
by splicing out of the region between nt. 729-998, 
the other by splicing out of the region between nt. 
999-1187. 



Expression 



Full-length expression constructs 

Mammalian cells 

Clone 4 containing the full CDS of VEGF-X (see figure 
9) , was used to generate constructs for expression of 
full-length protein. The sequence was amplified by 
PCR and cloned into the vector pCDNA6/V5-His so as to 
add a C-terminal V5 epitope tag and His 6 tag. The DNA 
and polypeptide sequence in this vector is shown in 
figure 19. Transient expression in COS cells 
followed by western blotting and detection via an . 
anti-V5 mAb demonstrates the secretion of a protein 
of -50K into the medium in transfected cells only 
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(figure 22A) . This construct can also be used to 
generate VEGF-X expressing stable CHO cell lines. 

Baculovirus/Insect-cell expression system 
For expression in the baculovirus/insect cell system 
the DNA encoding the predicted mature VEGF-X 
polypeptide sequence was fused to a sequence encoding 
a signal derived from melittin, a secreted insect 
protein. An N-terminal 6His tag was also added to 
facilitate purification. The insert was then cloned 
into the baculovirus expression vector pFASTBAC. The 
DNA and polypeptide sequence of this construct is 
shown in figure 20. Infection of Trichoplvsia ni Hi5 
cells with this recombinant baculovirus results in 
the secretion of a protein of approximately 45K into 
the medium (data not shown) . 

E. coli 

The coding region of VEGF-X has been cloned in a 
variety of ways for expression as a secreted protein 
in E.coli. A particularly useful expression clone 
carries an N-terminal fusion to the E.coli 
maltose-binding protein (MBP- derived from the 
expression vector pMAL-p2, New England Biolabs) and a 
C- terminal fusion to a 6His tag. The DNA and 
polypeptide sequence of this vector is shown in 
figure 21. Sequential purification of cell fractions 
on Ni-NTA resin and amylose resin allows the 
isolation of the expressed protein (see figure 22B) - 

Expression of fragments 

VEGF 

The VEGF domain of VEGF-X has been expressed in 
E.coli. Similar domains from VEGF-A (Christinger et 
al. (1996) PROTEINS: Structure, function and Genetics 
26, 353-357), and VEGF-D (Achen et al (1998) Proc. 
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Natl. Acad.Sci USA 95, 548-553) have been shown to 
be capable of binding to the respective receptors. 
Expression of these domains was carried out using the 
bacterium E.coli. Additionally, the full-length 
protein*. was expressed using the baculovirus/insect 
cell expression system. The oligonucleotide primers 
which have been obtained for these experiments are 
shown in figure. 13, The construct directed 
expression in the bacterial cytoplasm, and as 
expected the protein was produced in insoluble form 
in inclusion bodies (the DNA and polypeptide sequence 
used for PDGF domain expression is "shown in figure 
-24). Inclusion bodies were washed, solubilized with 
urea and the protein purified under denaturing 
conditions, before refolding by dialysis to remove 
the urea. Soluble protein was obtained, but shows 
little evidence of the disulphide bond linked dimers 
seen with material derived from animal cells (figure 
25, compare with figure 22A & B) . It is not clear 
therefore whether this protein is correctly folded. 

CUB 

The CUB domain has been expressed as a soluble 
secreted protein in E.coli (figure 26) . The protein 
was purified by binding to Ni-NTA resin (figure 27) 
and assayed for activity on KUVECs in an in-vitro 
proliferation assay. 

Properties of the VEGF-X protein 
The transient mammalian cell expression system 
described above has been used to generate full-length 
VEGF-X protein, as shown by antibody detection 
following Western blotting (see figure 22A) . 



35 



Disulphide bond linked dimer* 

The other members of the PDGF family of growth 
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factors, the PDGFs and VEGFs, all exist as dimers in 
which two monomers constituting the dimer are linked 
by interchain disulphide bonds. The x-ray structures 
of PDGF-BB (Oefner et al, 1992) , and VEGF-A (Muller 
5 et al, 1997) are known and indicate that at least 
these two members of the family contain two 
interchain disulphide bonds. Practically this means 
that in SDS-PAGE analysis of these growth factors the 
presence of interchain disulphide bonds is shown by a 

10 large decrease in mobility in the absence of reducing 
agent (ie. the nonreduced dimer migrates more slowly 
through the gel than the -reduced monomer) . This 
effect was also expected for VEGF-X, and. has been 
demonstrated for the material obtained from transient 

15 mammalian cell expression (figure 22A) . In the case 
of the full length material produced in E. coli only 
some 10% of the total VEGF-X protein appears to be 
present as disulphide bond-linked dimers (figure 
22B) . However, these results provide evidence that" 

20 the mammalian cell-derived protein is correctly 
folded, and that a portion of the E. co2i-derived 
protein is too. 

Glvcosylation 

25 There are 3 predicted potential N-linked 

glycosylation sites within the VEGF-X protein: at 
residues 25, 55 and 254 of the polypeptide sequence. 
The predicted molecular mass of the mature VEGF-X 
protein is 40kDa, but SDS-PAGE and western blotting 

30 (detection via an introduced C-terminal epitope tag- 
see figure 19) of the full-length protein expressed 
in COS cells gives a band slightly larger than the 
expected size (45-50kDa) as well as one at 25kDa 
(figure 22A) . This smaller band is presumed to be a 

35 C-terminal proteolysis fragment derived from the 

full-length molecule (controls from uninfected cells 
do not show this band) , probably corresponding to a 
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cleavage between the CUB and VEGF domains. EndoH 
treatment of the preparation gives a slight mobility 
change for the full-length protein (figure 23) , but 
for the smaller VEGF domain fragment there is a clear 
5 change , indicating that the predicted glycosylation m 
site within the VEGF domain at residue 254 is indeed 
glycosylated. 

Activity of proteins In cell-based assays 
10 Protein samples were tested for activity in cell 
proliferation, cell migration and in-vitro 
angiogenesis assays. Active samples can also be 
tested in the in vivo matrigel mouse model of 
angiogenesis, 

15 

Full-length VEGF-X protein 

Conditioned medium derived from COS cells transiently 
expressing VEGF-X (see figure 22A) displayed no 
detectable activity in any of the assays. However, 

20 as VEGF-X protein could only be detected in this 
preparation by Western blotting, and not by 
Coomassie-staining of gels, it is clearly present at 
very low levels and this may be the reason for the 
observed lack of activity in the cell proliferation, 

25 migration or in vitro angiogenesis tests, 

VEGF domain 

The VEGF domain protein described above has been 
tested in cell proliferation (on a range of cell 
30 types), cell migration and in vitro angiogenesis 
assays and has failed to show activity in any of 
these tests. As suggested above, this may be due to 
incorrect folding of this protein. 



35 



CUB domain 

The CUB domain protein at the highest dose tested 
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(lpg/ml) appears to inhibit proliferation of HUVBCs 
in the absence of other stimulation (figure 28 A & B) . 
This effect is also seen following stimulation with 
the lowest VEGF-A 165 dose tested (lng/ml- figure 28C) • 
5 The CUB domain of VEGF-X therefore appears to show . 
antiproliferative activity on HUVECs, even in the 
presence of low VEGF-A 165 doses . 

Tissue distribution of mRNA 

10 VEGF-A mRNA expression has been shown to be 

upregulated in a wide variety of human tumors (lung, 
breast, ovarian, colon, stomach, liver, pancreas, 
kidney, bladder and prostate- Takahashi et al, 1995) . 
Tumor VEGF-A expression has been shown to correlate 

15 with tumor growth rate, microvascular density and 
tumor metastasis (Takahashi et al, 1995) . It was 
thus of interest to examine the mRNA expression 
patterns of VEGF-X. Accordingly, Northern blot 
analysis of mRNA derived from different tissues has 

20 been carried out. The results indicate that although 
the VEGF-X mRNA is expressed at low levels, it is 
present in a wide range of tissues. PCR 
amplification of cDNA from a range of tissue sources 
supports this idea (figure 29A) . The major mRNA 

25 species is approximately 3.1kb in size. There is 

no significant upregulation seen in tumour cell lines 
or in tumour tissues tested (figure 29B) , with the 
possible exception of the cell lines GI-117 (lung 
carcinoma) and SaOS-2 (osteosarcoma) . The results of 

30 these initial tissue distribution studies do not, 
therefore, provide evidence for upregulation of 
VEGF-X in tumour growth, as is seen with VEGF-A. 

Genomic structure of the VEGF-X gene 
35 A genomic BAC clone covering the 3' part of the 

VEGF-X locus was isolated by hybridisation screening 
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of nylon filters containing a human BAC library. 
Direct sequencing of this clone using oligonucleotide 
primers based on the VEGF-X cDNA sequence allowed the 
determination of several intron/exon boundaries 
5 (figure 30). Interestingly, the position of the mRNA 
splice site within the PDGF domain (nt 1187/1188 in 
figure 30B) is conserved with respect to those in the 
VEGF-A and VEGF-D genes (Tischer et al, 1991; 
Rocchigiani et al, 1998). 

10 

Materials & Methods 



- PCR, Cloning, DNA sequence determination and BAC 
screening. 

15 All primers were purchased from Eurogentec, Seraing, 
Belgium. Insert-specific sequencing primers (15- and 16- 
mers) were designed by visual inspection of the DNA 
sequences. DNA was prepared on Qiagen-tip-20 columns or 
on Qiaquick spin columns (Qiagen GmbH, Diisseldorf, 

20 Germany) and recovered from the spin columns in 30^1 
Tris/EDTA-buffer (lOmM TrisHCl pH 7,5, 1 mM EDTA (sodium 
salt) ) . Sequencing reactions were performed using 
BigDye™ Terminator Cycle Sequencing Ready Reaction kits 
(Perkin Elmer, ABI Division, Foster City, CA, USA) and 

25 were run on an Applied Biosystems 377 DNA sequencer 
(Perkin Elmer, ABI Division, Foster City, CA, USA) . 
Polymerase chain reactions were carried out according 
to standard procedures (Ausubel et al, 1997). The 
PGR fragments were cloned into vectors pCR2>l 

30 (Invitrogen, Carlsbad, CA* USA) or pCR-TOPO 

(Invitrogen,NL) according to the manufacturer's 
instructions. One of those vectors, plasmid 
VEGFX/pCR2.1 1T0P0 FL 

was deposited on 1 March 1999 under Accession No. 
35 LMBP 3925. After sequence determination, the inserts 
were cloned into the desired expression vectors (see 
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figures 19, 20, 21, 24 & 26) . 

A human genomic BAC library (Genome Systems, Inc., St 
Louis, MI, USA) was screened by hybridisation to 
5 oligonucleotides derived from the VEGF-X cDNA 
sequence, according to the manufacturer's 
instructions, BAC DNA was prepared using a Qiagen 
plasmid midi kit (Qiagen GmbH, Diisseldorf , Germany ) 
according to the manufacturer's instructions with 

10 some modifications (after clearing of the lysate from 
chromosomal DNA, supernatants from individual 
preparations were pooled on a single column (tip 
100), and after the. 70 % EtOH wash, the pellet was 
resuspended overnight at 4°C in 100 ul TE) . 20-mer 

IS sequencing primers were designed based on the known 
cDNA sequence, and sequencing carried out as above. 

5' RACE 

20 In order to extend the cDNA clone in a 5' direction 
RACE reactions were carried out. Since it was known 
that the mRNA is present in placenta and skeletal 
muscle, Marathon-Ready™ placenta and skeletal muscle 
cDNAs were purchased from Clontech (Palo Alto CA. 

25 USA) and used according to the manufacturer's 
instructions. DNA fragments were excised from 
agarose gels, purified using QiaQuick PCR 
purification columns (Qiagen GmbH, Diisseldorf, 
Germany) and sequenced directly. 

30 

VEGF-X protein expression and purification 
DNA fragments encoding the desired protein sequences 
were amplified by PCR and cloned into appropriate 
expression vector systems. 

35 

For mammalian cell expression, the full coding 
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sequence was cloned into the vector pcDNA6/V5-his 
(Invitrogen Leek/ NL, see figure 19 for construct 
sequence) , so as to add a C-terminal peptide tag to 
assist in detection and purification. 

5 

For insect cell expression the sequence of the 
predicted mature polypeptide was initially amplified 
to add an N-terminal 6His peptide and then cloned 
into the pMelBacB vector (Invitrogen, Leek, NL) to 
10 add an insect cell signal sequence. The entire 

insert was then PCR-cloned into the vector pFASTBAC-1 
(LifeTechnologies, Gaithersburg, MA, USA) for 
construction of a baculovirus according to the 
manufacturer's instructions. 

15 

.For E.coli expression the coding region was PCR 
amplified to add a C-terminal 6His tag and then 
cloned into the vector pMAL-p2 (New England Biolabs, 
Beverly, MA, USA) . The coding sequence of this 
20 construct is shown in figure 21) , The protein was 
purified first on Ni-NTA resin (Qiagen GmbH, 
Dtisseldorf , Germany) and then on amylose resin (New 
England Biolabs, Beverly, MA, USA), according to the 
manufacturers ' instructions . 

25 

DNA sequences encoding the CUB and VEGF domain 
fragments of VEGF-X were PCR amplified and cloned 
into pET22b and pET21a (Novagen, Madison, WI, USA) 
respectively. The CUB domain protein was prepared 

30 either from the periplasm or medium of induced 

cultures by standard methods (Ausubel et al, 1997). 
The protein was initially purified by precipitation 
with 20% ammonium sulphate. After overnight dialysis 
vs 20mM Tris Hcl pH7.5 f lOOmM NaCl to remove ammonium 

35 sulphate, the protein was further purified on Ni-NTA 
resin as described above. The. VEGF domain protein 
was expressed in insoluble form, and preparation of 
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inclusion bodies was carried out using standard 
procedures (Ausubel et al 1997), Inclusion bodies 
were dissolved in 6M guanidine hydrochloride, 20mM 
Tris" Hcl pH8.0, 200mM NaCl, ImM 2-mercaptoethanol, 
5 and purified on Ni-NTA resin (Qiagen GmbH, 

Dusseldorf, Germany) according to the manufacturer's 
instructions. The protein was refolded by dialysis 
against several .changes of buffer containing 
decreasing concentrations of denaturant. 

10 

Analysis of protein glycosylation was carried out 
using EndoH (Roche Molecular Biochemicals, Brussels, 
BE) according to the manufacturer's instructions. 

15 Cell Proliferation Assay 

Human umbilical vein endothelial cells (HUVECs) 
(Clonetics, San Diego, CA.) were trypsinized with 
0.05% trypsin/0. 53mM EDTA (Gibco, Gaithersburg, MD-), 
resuspended in the EGM-2 (Clonetics, San Diego, CA. ) , 

20 counted, and distributed in a 96-well tissue culture 
plate at 5,000 cells/well. Following cell attachment 
and monolayer formation (16 hours), cells were 
stimulated with various concentrations of truncated 
VEGF-X (CUB domain or VEGF domain) or dilutions of 

25 culture supernatants of the full-length VEGF-X (COS 7 
or HEK293) in DMEM (Gibco, Gaithersburg, MD.) 
containing 0.5% to 2% FBS (HyClone, Logan, UT) as 
indicated. For human fetal dermal fibroblasts 
(American Type Culture Collection, Rockville, MD.), 

30 the growth medium was replaced by DMEM containing 
0.1% BSA (Sigma, St. Louise, MO.) with or without 
various concentrations of truncated VEGF-X proteins. 
For HCASMC (Clonetics, San Diego, CA.), the medium 
was replaced by DMEM containing 0.5% FBS. The cells 

35 were treated for a further 24 hr-72 hr. For the 

measurement of proliferation, the culture media were 
replaced with 100 pi of DMEM containing 5% FBS and 3 
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pCi/rnl of [3H] -thymidine (Amersham, Arlington 
Heights, IL.). Following pulse labeling, cells were 
fixed with methanol/acetic acid (3:1, vol/vol) for 1 
hour at room temperature . The cells were washed 
5 twice with 250 yd/well of 80% methanol. The cells 
were solubilized in 0.05% trypsin (lOOpl/well) for 
30 minutes then in 0.5% SDS (100 pl/well) for another 
30 minutes. Aliquots of cell lysates (190 were 
combined with 2 ml of scintillation cocktail (Fisher, 
10 Springfiled, NJ) and the radioactivity of cell 

lysates was measured using a liquid scintillation 
counter (Wallac 1409) . In each case, samples were 
performed in quadruplicate. 

15 Chemo taxis Assay 

The chemotactic response of HUVECs was assayed using 
a 48-well modified Boyden chamber (NeuroProbe, Cabin 
John, MD. ) and collagen-coated (O.lmg/ml type I 
collagen, Collaboratic Biomedical, Bedford, MA.) 

20 polycarbonate membrane filters with a pore diameter 
of 8 jim (NeuroProbe, Cabin John, MD.). Cell 
suspensions (15, 000/well) were loaded to the upper 
part of the chemotaxis chamber and stimulated for 4 
hours with rhVEGF 165 (0.1-10 ng/ml) (Calbiochem, San 

25 Diego, CA.) or various concentrations of truncated 
VEGF-X (PDGF domain) . Cells remaining on the top of 
the membrane were removed. Migration was assessed by 
counting the number of cells that migrated to the 
lower side "of the filter membrane. The membrane was 

30 fixed with 10% formaldehyde for 15 min, followed by 
staining with Gill's hemotoxylin III- (Poly 
Scientific, Bay Shore, NY.). The assay was performed 
in triplicates and six independent high power fields 
per well were counted using a light microscope at 250 

35 magnification. The results were expressed as the fold 
of unstimulated cells (EGM containing 0.1% BSA) . 
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Zn Vitro Angiogenesis Assay 

In vitro angicgenesis in fibrin gels was quantitated 
using spheroids of human umbilical vein endothelial 
cells (Korff et al., 1998). To generate endothelial 
5 cell spheroids of defined size and cell number/ a 
specific number of cells (- 800 cells per spheroid) 
was suspended in EGM-2 culture medium containing 20% 
methylcellulose .(Sigma, St. Louis, MO.), seeded into 
nonadherent round-bottom 96-well plates. All 
10 suspended cells in one well contributed to the 
formation of a single endothelial cell spheroid 
within 24 hours, A fibrin gel stock solution was 
prepared freshly prior to use by mixing 3mg/ml 
fibrinogen (Calbiochem, San Diego, CA.) in Medium 
15 199(Gibco f Gaithersburg, MD, ) . Assays were performed 
in 24-well culture plates. The 1ml fibrinogen stock 
was mixed with 50 HUVEC spheroids and the 
corresponding test substance including rh-VEGF 165 or 
various concentration of VEGF-X. The 
20 spheriod-containing fibrinogen was rapidly 

transferred into 24-well plates. Fifteen microliters 
of thrombin (100 NIH U/ml stock, Sigma, St. Louis, 
MO.) was added to the gel for the fibrin gel 
formation. The gel formation usually occurred within 
25 30 seconds. "After gel formation, Iml/well of Medium 
199 supplemented with 20% FBS, Img/ml c-aminocaproic 
acid (Calbiochem, San Diego, CA.) and antibiotics 
were added. The gel was incubated at 37 °C (5%C0 3 , 95% 
air, 100% humidity). After 3 days, in vitro 
30 angiogenesis was quantitated by measuring the length 
of the three longest capillary sprouts that had grown 
out of each spheroid (100X magnification), analyzing 
at least 10 spheroids per experimental group and 
experiment. 



Matrigel Mouse Assay 



The matrigel mouse assay is carried out as described 
by Passanti et al (X992) . 

Analysis of VEGF-X gene expression by RT-PCR 
analysis . 

Oligonucleotide primers VEGF-E2 and VEGF-X14 (figure 
16; figure 5) were used for the specific PGR 
amplification of- a 350 bp fragment from VEGF-X. PCR 
amplifications, were performed on human multiple 
tissue cDNA (MTC™) panels (Clontech human MTC panels 
1 and II and human Tumor MTC panel) normalised to the 
mRNA expression levels of six different housekeeping 
genes. In addition, cDNA was made from different 
tumor cell cultures (Caco-2 colorectal 
adenocarcinoma; T-84 colorectal carcinoma; MCF-7 
breast adenocarcinoma; T-47D breast ductal gland 
carcinoma; HT1080 bone fibrosarcoma; SaOS-2 
osteosarcoma; SK-N-MC neuroblastoma; HepG2 
hepatoblastoma; JURKAT T-cell leukemia and THP-1 
myelomonocytic leukemia) . For the preparation of 
tumor cell cDNA/ cells were homogenised and total RNA 
prepared using the RNeasy Mini kit (Qiagen GmbH, 
Hilden, Germany) according to manufacturer's 
instructions. 1 pg of total RNA was reverse 
transcribed using oligo(dT)15 as a primer and 50 U of 
Expand™ Reverse Transcriptase (Boehringer Mannheim, 
Mannheim, Germany) according to the manufacturer's 
instructions. PCR reactions with VEGF-X-specif ic oy 
glyceraldehyde-3-phosphate dehydrogenase 
(G3PDH) -specific primers were then performed on 1 yd 
of this cDNA. For all cDNAs, PCR reactions with 
VEGF-X specific primers were performed in a total 
volume of 50 pi, containing 5 pi (±1 ng) of cDNA, Ix 
Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 
250 nM of primers VEGF-E2 and VEGF-X14 and 1 \xl of 
Advantage KlenTaq polymerase mix. Samples were heated 
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to 95 °c for 30 s and cycling was done for 30 s at 
95°C and 30 s at 68°C fox 25, 30 or 35 cycles. 
Control reactions using specific primers that amplify 
a 1 kb fragment of the housekeeping gene G3PDH were 
5 also performed according to the manufacturer's 
instructions. 



Northern blot analysis of VEGF-X. 

Northern blots containing 2 yig of poly (A) -rich RNA 
derived from different human tissues (Clontech 
Laboratories; MTN™ blot, MTN™' blot II and Cancer Cell 
Line MTN™ blot) were hybridized according to the 
manufacturers instructions with a a-[ 32 P]-dCTP 
random-priming labelled (Multiprime labelling kit, 
Roche Diagnostics) 293 bp specific VEGF-X fragment 
(PlnAI-StuI fragment including 92 bp of the 3' end 
coding region and 201 bp of the 3' untranslated 
region of VEGF-X) . The blots were hybridized 
overnight at 68 °C and final washes at high stringency 
were at 68 °C in O.lx SSC/0.1 % SDS. The membranes 
were autoradiographed for 1 to 3 days with 
intensifying screens. 



Full length VEGF-X 

The effect of full length VEGF-X on proliferation of 
HuVEC cells was determined by the 3 H-Thymidine 
incorporation assay, HuVEC cells were serum starved 
for 24 hours prior to treatment with the full length 
VEGF-X at the concentration range from 100 pg/ml-10 
pg/ral. There was no effect of VEGF-X at 100 pg/ml-10 
ng/ml on endothelial cell proliferation. At the 
higher concentrations of FL-VEGF-X (100 ng/ml and 1 
pg/ml) there was a marked inhibition of endothelial 
cell proliferation. This is probably due to the very 
high endotoxin level in the samples. The VEGF-X 
sample was purified in order to decrease the 
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endotoxin level and is currently tested in the cell 
proliferation assay. 

The Summary from Testing the CUB Domain 
5 The effect of CUB domain on inhibition of HuVEC 
prolieration either serum- (2%), rh-VEGF or bFGF- 
stimulated, was assessed by the 3 H-Thymidine 
incorporation assay. Cells were serum starved 
followed by the treatment with the CUB domain and 

10 various growth factors. Results showed that the CUB 
domain inhibited endothelial cell proliferation, 
either serum- (2%), rh-VEGF or bFGF-stimulated in a. 
dose dependent manner with maximal inhibition at 10 
Ug/ml. There was approximately a 2-fold inhibition 

15 of proliferation {at 10 pg/ml) of cells stimulated 

with VEGF and bFGF and nearly a 5-fold inhibition of 
cells stimulated with serum (2%) . Results with the 
LDH assay showed that there was no cytotoxicity 
associated with the inhibition of cell proliferation 

20 by the CUB domain. 

Therefore, the N-terminus of the polypeptide from 
Figure 10 has been shown to possess a CUB domain. 
When database searches are carried out using the 

25 full-length coding sequence the best matches {i.e. 

for a BLAST search/ those with the lowest probability 
score) are found with the CUB domain rather than with 
the VEGF-like domain. The best match from searching 
release 37 of the SWISSPROT database (Feb 1999) is to 

30 the CUB domain of a neuropilin from Xenopus laevis, 
and the matches to the CUB domains of human 
neuropilins 1 and 2 are also more significant than 
matches to the VEGFs . 

35 This similarity is provocative, given the 

identification of neuropilin-1 and -2 as cellular 
receptors for the VEGF- A 165 (Stoker et al. 1998, 



WO 00/37641 



- 47 - 



PCT/US99/30503 



reviewed in Neufeld et al. 1999). It is plausible 
therefore that VEGF-X could exert dual regulatory 
effects; via interaction with the tyrosine kinase 
VEGF-receptors mediated by the VEGF-like domain, as 
5 well as via interaction with VEGF isoforms or with 
the neurophilin receptors, mediated by the CUB 
domain. 

To the best of our understanding the latter would be 
10 entirely novel, and searches on the most recent 
release of the Incyte database do not reveal any 
other proteins containing both CUB and VEGF-like 
domains. This arrangement of domains suggests 
possible positive or negative models of regulation: 

15 

Positive- the VEGF-like domain is able to interact 
productively with the tyrosine kinase VEGF receptors 
giving activation, and the CUB domain is able to 
interact productively with the neuropil in receptor 
20 giving activation. 

Negative- the VEGF-like domain does not interact 
productively with the tyrosine kinase VEGF receptors, 
either preventing receptor dimerisation or blocking 

25 the VEGF binding sites. Further, the CUB domain does 
not interact productively with the neuropilin 
receptors, either preventing receptor activation or 
blocking the VEGF binding sites, or indeed by binding 
to VEGF isoforms and preventing their interaction 

30 with receptors. 
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10 



15 



20 



ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 


ALA 


SER, 


THR 




ARC 


LYS 






ASN 


HIS, 


SER 




ASP 


GLU, 


ASN 




CYS 


SER 






GLN 


ASN, 


HIS 




GLU 


ASP, 


GLU 




GLY 


ALA, 


SER 




HIS 


ASN, 


GLN 




ILE 


LEU, 


VAL, 


THR 


LEU 


ILE, 


VAL 




LYS 


ARG, 


GLN, 


GLU, THR 


MET 


LEU, 


ILE, 


VAL 


PHE 


LEU, 


TYR 




SER 


THR, 


ALA, 


ASN 


THR 


SER, 


ALA 




TRP 


ARG, 


SER 




TYR 


PHE 






VAL 


ILE, 


LEU ALA 


PRO 


ALA 
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SEQUENCE LISTING 



Sequence ID No 1 



corresponds to the amino acid 
sequence from position 23 to 345 
of the amino acid sequence 
illustrated in Figure 10. 



10 



15 



20 



Sequence ID No, 2 is the amino acid sequence 

illustrated in Figure 10. 



Sequence ID No 3 



Sequence ID No 4 



Sequence ID No 5 



corresponds to the sequence from 
position 257 to 1291 of the 
nucleotide sequence illustrated 
in Figure 9. 

corresponds to the polynucleotide 
sequence of VEGFX1 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX2 illustrated in 
Figure 3, 



25 



30 



35 



Sequence ID No 6 



Sequence ID No 7 



Sequence ID No 8 



corresponds to the polynucleotide 
sequence of VEGFX3 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX4 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX5 illustrated in 
Figure 3. 



Sequence ID No 9 corresponds to the polynucleotide 

sequence of VEGFX6 illustrated in 
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Sequence ID No 10 

5 

Sequence ID No 11 

10 

Sequence ID No 12 



15 Sequence ID No 13 



Sequence ID No 14 

20 



Sequence ID No 15 

25 

Sequence ID No 16 



30 

Sequence ID No 17 



35 Sequence ID No 18 



Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX10 illustrated 
in Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFXll illustrated 
in Figure 4. 

corresponds to the polynucleotide 
sequence of VEGFX12 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence of VEGFX13 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence of VEGFX14 illustrated 
in Figure 4. 

corresponds to the polynucleotide 
sequence 5 ' -1 in Figure 8. 
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Sequence ID No 19 
Sequence ID No 20 

5 

Sequence ID No 21 

10 

Sequence ID No 22 

15 

Sequence ID No 23 
20 Sequence ID No 24 
Sequence ID No 25 

25 

Sequence ID No 26 

30 

Sequence ID No 27 

35 

Sequence ID No 28 
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corresponds to the polynucleotide 
sequence 5 ! -2 in Figure 8. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 13 • 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC1 illustrated 
in Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC2 illustrated 
in Figure 13. 

corresponds to a polypeptide 
having the amino acid sequence 
from amino acid position 40 to 
150 of the sequence of Figure 10. 

corresponds to a polypeptide 
having the amino acid sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
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position 5 to 508 of the 
nucleotide sequence illustrated 
in Figure 26. 

corresponds to the nucleotide 
sequence from position 5 to 508 
of the nucleotide sequence 
illustrated in Figure 26. 

•corresponds to the sequence from 
position .214 to 345 of the 
nucleotide sequence illustrated 
in Figure 10. 



CLAIMS 



1. A nucleic acid molecule encoding a VEGF-X 
protein or a functional equivalent, derivative or 
bioprecursor thereof, said protein comprising any of 
the sequences from position 23 to 345 of the amino 
acid sequence illustrated in Figure 10, or the 
complete sequence as illustrated in Figure 10. 

2. A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

3. A nucleic acid molecule according to claim 1 or 
2 wherein said nucleic acid is a cDNA molecule. 

4. A nucleic acid molecule according to claim 3 
comprising the nucleotide sequence from position 257 
to 1291 of the nucleotide sequence illustrated in 
Figure 9, or sequences that hybridise thereto under 
high stringency conditions or the complement thereto. 

5. An antisense molecule capable of hybridising to 
a molecule according to any of claims 1 to 4 under 
high stringency conditions. 

6. A nucleic acid molecule according to any of 
claims 1 to 4 which is of mammalian origin. 

7. A nucleic acid molecule according to claim 6 
which is of human origin. 

8. An isolated VEGF-X protein, or a functional 
equivalent, derivative or bioprecursor thereof, 
having an amino acid sequence from position 23 to 345 
of the amino acid sequence illustrated in Figure 10 
or the complete amino acid sequence of Figure 10. 
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9. A VEGF-X protein, or a functional equivalent, 
derivative or bioprecusor thereof, encoded by a 
nucleic acid molecule as defined in any of claims 1 
to 4. 

5 

10. A protein according to claim 9, which comprises 
the amino acid sequence illustrated in Figure 10. 

XI. An expression vector comprising a nucleic acid 
10 molecule according to any of claims 1 to 4. 

12. An expression vector according to claim 11 
further comprising a nucleotide sequence encoding a 
reporter molecule* 

15 

13 • An expression vector comprising an antisense 
molecule according to claim 5. 

14. A nucleic acid molecule according to any of 

20 claims 1 to 4 or an antisense molecule according to 
claim 5 for use as a medicament. 

15. A host cell transformed or transfected with an 
expression vector according to claim 11 or 12. 

25 

16. A host cell transformed or transfected with an 
expression vector according to claim 13. 

17. A transgenic cell, tissue or organism comprising 
30 a transgene capable of expressing a VEGF-X protein 

according to claim 8 or 9. 

18. A transgenic cell, tissue or organism according 
to claim 17, wherein said transgene is included in an 

35 expression vector. 



19. A VEGF-X protein or a functional equivalent, 
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derivative or bioprecursor thereof, expressed by a 
cell according to claim 15. 

20. A VEGF-X protein, or a functional equivalent, 
derivative or bioprecursor thereof, expressed by a 
transgenic cell, tissue or organism according to 
claim 17. 

21. A process for producing a VEGF-X protein 
according to any of claims 8 to 10, said process 
comprising transforming a host cell or organism with 
an expression vector according to claim il, and 
recovering the expressed protein from said host cell 
or organism. 

22. An antibody capable of binding to a protein 
according to any of claims 8 to 10, or an epitope 
thereof. 

23 * An antibody according to claim 22 for use as a 
medicament. 

24. A pharmaceutical composition comprising an 
antibody according to claim 22 together with a 
pharmaceutical^ acceptable carrier diluent or 
excipient thereof. 

25. A method of identifying VEGF-X protein in a 
sample which method comprises contacting said sample 
with an antibody according to claim 22 and monitoring 
for binding of any protein to said antibody. 

26. A kit for identifying the presence of VEGF-X 
protein in a sample which comprises an antibody 
according to claim 22 and means for contacting said 
antibody with said sample. 
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27. A method of identifying compounds which modulate 
angiogenesis which method comprises providing a host 
cell or organism according to claim 15 or a 
transgenic cell, tissue or organism according to 
claim 17, contacting a test compound with said cell, 
tissue or organism and monitoring for an effect of 
said compound on said VEGF compared to a host cell or 
organism according to claim 15 or a transgenic cell 
tissue or organism according to claim 17 which has 
not been contacted with said compound. 

28. A compound identifiable according to the method 
of claim 27. 

29. A compound according to claim 28 for use as a 
medicament, 

30. A nucleic acid sequence comprising the 
nucleotide sequences illustrated in any of Figures 3, 
5, 8 or 13. 

31. A method for producing a polypeptide, said 
method comprising the steps of: 

a) culturing the host cell of claim 15 under 
conditions suitable for expression of the 
polypeptide; and 

b) recovering the polypeptide from the host 
cell culture. 

32. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said subject an amount of an 
antisense molecule according to claim 5 in sufficient 



WO 00/37641 



PCT/US99/30503 



- 60 - 

concentration to reduce or prevent said angiogenic 
activity. 

33. A method of inhibiting angiogenic activity or 
5 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 

10 an amount of an antibody according to claim 22 in 
sufficient concentration to reduce or prevent said 

. . . angiogenic activity or inappropriate vascularisation. 

34. A method of inhibiting angiogenic activity or 
15 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a 
subject, said method comprising implanting in said 
20 subject cells that express an antibody according to 
claim 22. 

35. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 

25 retinopathy, said method comprising administering to 
said subject an amount of an antisense molecule 
according to claim 5 in sufficient concentration to 
treat or prevent said disorders. 

30 36. A method of treating or preventing any of 

cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antibody according to 
claim 22 in sufficient concentration to reduce or 

35 prevent said disorders. 

37. A method of promoting angiogenic activity or 
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vascularisation to promote wound healing, skin graft 
growth, tissue repair, proliferation of new blood 
vessels, tissue regeneration and organ repair which 
method comprises applying or delivering to a site of 
interest a therapeutically effective amount of any of 
a group selected from a protein according to claim 8 
and a nucleic acid molecule encoding a VEGF-X protein 
or a functional equivalent, derivative or 
bioprecursor thereof comprising an amino acid 
sequence illustrated in Figure 10, an expression 
vector comprising said nucleic acid molecule and a 
pharmaceutical composition comprising any of said 
nucleic acid molecule and said protein. 

15 38. A method of treating wounds selected from the 
group consisting of dermal ulcers, pressure sores, 
venous sores, diabetic ulcers and burns by applying 
to said wound a therapeutically effective amount of 
any of a VEGF-X protein according to claim 8, a 

20 pharmaceutical composition comprising said protein 

and a pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

39. A nucleic acid molecule encoding a polypeptide 
25 having a CUB domain said polypeptide comprising the 
amino acid sequence from position 40 to 150 of the 
sequence of Figure 10. 



30 



35 



40. A nucleic acid molecule encoding a polypeptide 
having a CUB domain, said polypeptide comprising the 
amino acid sequence of Figure 26. 

41. A nucleic acid molecule according to claim 39 or 
40, comprising the nucleotide sequence from position 
5 to 508 of the sequence illustrated in Figure 26. 

42. A nucleic acid molecule according to any of 
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claims 39 to 41 comprising the nucleotide sequence 
illustrated in Figure 26. 

43. A nucleic acid molecule encoding a VEGF like 
domain comprising the sequence from position 214-345 
of the sequence of Figure 10 or the sequence from 
position 15 to 461 illustrated in Figure 24. 

44. An expression vector comprising a nucleic acid 
molecule according to any of claims 39 to 42. 

45. An expression vector comprising a nucleic acid 
molecule according to claim 43, 

46. A host cell transformed or transfected with an 
expression vector according to claim 44. 

47. A host cell transformed or transfected with an 
expression vector according to claim 45. 

48. A protein expressed by the cell according to 
claim 46. 

49. A protein expressed by the cell according to 
claim 47. 

50. A method of identifying compounds that inhibit 
or enhance angiogenic activity, said method 
comprising contacting a cell expressing a VEGF 
receptor and/or a neuropilin 1 or 2 type receptor 
with said compound in the presence of a VEGF-X 
protein according to claim 8 and monitoring for the 
effect of said compound or said cell when compared to 
a cell which has not been contacted with said 
compound. 

51. A compound identifiable according to the method 
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of claim 50 as an inhibitor or enhancer of angiogenic 
activity. 

52. A method of inhibiting angiogenic activity or 
inappropriate vascularisation, said method comprising 
contacting a cell expressing, a VEGF receptor and a 
neuropilin type receptor with a protein selected from 
any of a protein according to any of claims 8 to 10 
and a protein according to claim 48 or a protein 
according to claim 49. 

53. Use of a nucleotide sequence illustrated in any 
of Figures 14 and 15 in identifying a VEGF-X protein 
according to claim 8. 

54. A nucleic acid molecule encoding a polypeptide 
comprising a CUB domain having the sequence from 
position 4 0 to 150 of the sequence of Figure 10 or 
from position 5 to 508 of the sequence of Figure 26 
and a sequence encoding a VEGF domain, 

55. A nucleic acid molecule according to claim 54 
wherein said sequence encoding said VEGF domain is 
selected from the sequences encoding any of VEGF A to 
D or isoforms or variants thereof. 

56. A nucleic acid molecule encoding a polypeptide 
comprising the amino acid sequence from position 40 
to 150 of the sequence illustrated in Figure 10 for 
use as a medicament. 

57. Use of a nucleic acid molecule encoding a 
polypeptide having the amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 in the manufacture of a medicament for 
treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour or cancer 
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growth, retinopathy, osteoarthritis or psoriasis, 

58. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in figure 10 for use as a medicament. 

59. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in Figure 10 in the manufacture of a medicament for 
the treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour growth, 
retinopathy, osteoarthritis or psoriasis. 

60. Use of a CUB domain comprising the amino acid 
sequence from position 40 to 150 of the sequence of 
Figure 10, or the amino acid sequence of Figure 26, 
to identify compounds which inhibit angiogenic 
activity in a method according to claim 50. 

61. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said subject an amount of a 
polypeptide having an amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 or a nucleic acid molecule according to any 
of claims 39 to 42 in sufficient concentration to 
reduce or prevent said angiogenic activity. 

62. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of a polypeptide having an 
amino acid sequence from position 40 to 150 of the 
sequence illustrated in Figure 10 or a nucleic acid 
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molecule according to any of claims 39 to 42 in 
sufficient concentration to treat or prevent said 
disorders. 

5 63. An antisense molecule capable of hybridising to 
a molecule according to any of claims 39 to 42 under 
high stringency conditions. 

64. An antisense molecule capable of hybridising to 
10 a molecule according to claim 43 under high. 

stringency conditions. 

65. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 

15 to claim 48. 

66. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
to claim 49. 

20 

67. A transgenic, cell tissue or organism 
according to claim 65 or 66, wherein said transgene 
is included in an expression vector according to 
claim 41 or 42. 

25 

68. An antibody capable of binding to a protein 
according to claim 48 or an epitope thereof. 

69. An antibody capable of binding to a protein 
30 according to claim 4 9 or an epitope thereof. 

70. A pharmaceutical composition comprising an 
antibody according to claim 68 or 69 together with a 
pharmaceutically acceptable carrier diluent or 

35 excipient therefor. 

71. A pharmaceutical composition comprising a 



WO 00/37641 



PCTAJS99/30503 



- 66 - 

compound according to claim 48 together with a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

72. A nucleic acid molecule encoding a variant of a 
VEGF-X protein having any of the sequences of 
nucleotides illustrated in Figure 12. 
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1 AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC ACTACCTTGA TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG ATTTCCTTTA GTTTAATCCT 

+1 MetAsn IlePheLeu LeuAsnLeuLeu ThrGluGlu ValArgLeu 

]„_" 

181 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

+1 TyrSerCysThr ProArgAsn PheSerVal SerlleArgGlu GluLeuLys ArgThrAsp 



2 41 TACAGCTGCA CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT 
ATGTCGACGT GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA 

+1 ThrllePheTrp ProGlyCys LeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys 



3 01 ACCATTTTCT GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 
TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

+1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 



3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+1 ValLeuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal 



421 GTCCTTCAGT TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTCTGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+1 AlaLeuGluHis HisGluGlu CysAspCys ValCysArgGly SerThrGly Gly 



GTGTGCAGAG GGAGCACAGG AGGATAGCCG 
CACACGTCTC CCTCGTGTCC TCCTATCGGC 

GTGCAGTGCA GTGGCTGATT CTATTAGAGA 
CACGTCACGT CACCGACTAA GATAATCTCT 

GTTGTTTGCT TCAAGGACCT TTCATCTTCA 
CAACAAACGA AGTTCCTGGA AAGTAGAAGT 

ATCAAACAGA ATTAGGAGTT GTGCAACAGC 
TAGTTTGTCT TAATCCTCAA CACGTTGTCG 

AAAAGGTCTT CAATCGTGGA AAGAAAATTA 
TTTTCCAGAA GTTAGCACCT TTCTTTTAAT 

TTCAGAGTTA CCATGTACGT ATTCCACTAG 
AAGTCTCAAT GGTACATGCA TAAGGTGATC 
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481 GCCCTGGAGC ACCATGAGGA GTGTGACTGT 
CGGGACCTCG TGGTACTCCT CACACTGACA 

541 CATCACCACC AGCAGCTCTT GCCCAGAGCT 
GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA 
TGCATACGCA ATAGAGGTAG GAATTAGAGT 

661 GGATTTACAG TGCATTCTGA AAGAGGAGAC 
CCTAAATGTC ACGTAAGACT TTCTCCTCTG 

721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG 
AGAAAACTCT CCTCCGGATT TCCTGTCCTC 

781 AATGTTGTAT TAAATAGATC ACCAGCTAGT 
TTACAACATA ATTTATCTAG TGGTCGATCA 
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841 CTGGGTTCTG TATTTCAGTT CTTTCGATAC 
GACCCAAGAC ATAAAGTCAA GAAAGCTATG 

901 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC 
TGACACGTTC ACTCGTGGAC TAAGGCAACG 

961 GCCTAAAATC GTATAAAATC TGGATTTTTT 
CGGATTTTAG CATATTTTAG ACCTAAAAAA 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG 
TTGGTCTTGT AAGATACATG ATGTTTGGAC 

1081 TTAAACTTGT GTCGTGCTGA TAGGACAGAC 
AATTTGAACA CAGCACGACT ATCCTGTCTG 

1141 TGCCATTTAG AAGAAGAGAA CTACATTCAT 
ACGGTAAATC TTCTTCTCTT GATGTAAGTA 

1201 GTGGCCTTAT CTTCACTTTA TCGATAAGTC 
CACCGGAATA GAAGTGAAAT AGCTATTCAG 

1261 TATTCTCCTT TTGACATTAT AACTGTTGGC 
ATAAGAGGAA AACTGTAATA TTGACAACCG 

1321 TACCAAAGGT ATTTAATATT CTTTTTTATG 
ATGGTTTCCA TAAATTATAA GAAAAAATAC 

13 81 AAATTTTTCT AAACACAATT GTTATAGCCA 
TTTAAAAAGA TTTGTGTTAA CAATATCGGT 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC 
GAGACTGTTT TTATGTACAT AAAGTAAGAG 

1501 TTTAAAAAAC TGAATTGGAA TAGAATTGGT 
AAATTTTTTG ACTTAACCTT ATCTTAACCA 

1561 ATTATCATAT CTTCCATTCC TGTTATTGGA 
TAATAGTATA GAAGGTAAGG ACAATAACCT 

1621 GACATTCAGA TCCAGCCATT ACTAACCTAT 
CTGTAAGTCT AGGTCGGTAA TGATTGGATA 

1681 AGAAAAACAT AAAGCACCTT GAAAAAGACT 
TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA 

1741 ' GTGCAGTAGG AACACATCCT ATTTATTGTG 
CACGTCATCC TTGTGTAGGA TAAATAACAC 

1801 TCCATACACT TGTATAAATA CATGGATATT 
AGGTATGTGA ACATATTTAT GTACCTATAA 

1861 TTCACTTATT GTACCTGG 
AAGTGAATAA CATGGACC 



GGCTTAGGGT AATGTCAGTA CAGGAAAAAA 
CCGAATCCCA TTACAGTCAT GTCCTTTTTT 

CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 
GAACGAATTG AGATTTCGAG GTACAGGACC 

TTTTTTTTTT TGCTCATATT CACATATGTA 
AAAAAAAAAA ACGAGTATAA GTGTATACAT 

GTTTTTAAAA AGGAACTATG TTGCTATGAA 
CAAAAATTTT TCCTTGATAC AACGATACTT 

TGGATTTTTC ATATTTCTTA TTAAAATTTC 
ACCTAAAAAG TATAAAGAAT AATTTTAAAG 

GGTTTGGAAG AGATAAACCT GAAAAGAAGA 
CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

AGTTTATTTG TTTCATTGTG TACATTTTTA 
TCAAATAAAC AAAGTAACAC ATGTAAAAAT 

TTTTCTAATC TTGTTAAATA TATCTATTTT 
AAAAGATTAG AACAATTTAT ATAGATAAAA 

ACAACTTAGA TCAACTATTT TTAGCTTGGT 
TGTTGAATCT AGTTGATAAA AATCGAACCA 

GAGGAACAAA GATGATATAA AATATTGTTG 
CTCCTTGTTT CTACTATATT TTATAACAAC 

GTATGGTGCT AGAGTTAGAT TAATCTGCAT 
CATACCACGA TCTCAATCTA ATTAGACGTA 

AAGTTGCAAA GACTTTTTGA AAATAATTAA 
TTCAACGTTT CTGAAAAACT TTTATTAATT 

GATGAAAATA AAAAGCAACT TATGAAAGTA 
CTACTTTTAT TTTTCGTTGA ATACTTTCAT 

TCCTTTTTTG GGGAAATCTG AGCCTAGCTC 
AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

TGGCAGCTTC CTGATAAAGC GTGCTGTGCT 
ACCGTCGAAG GACTATTTCG CACGACACGA 

ATGTTGTGGT TTTATTATCT TAAACTCTGT 
TACAACACCA AAATAATAGA ATTTGAGACA 

TTTATGTACA GAAGTATGTC TCTTAACCAG 
AAATACATGT CTTCATACAG AGAATTGGTC 
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Predicted VEGF-like protein encoded by Incyte contig of 8/12/98 

1 MNIFLLNLLT EEVRLYSCTP RNFSVSIRHE LKSTDTIFWP GCLLVKRCGG 
51 NCACCLHNCN ECQCVPSKVT KKVHTVLQLR PKTGVRGLHX SLTDVALEHH 
101 ESCOCVCRGS TGG 



^/2tT vj? pcr primers for cloning VEGF-X 

vegfXl AAAATGTATGGATACAACTTAC 

vegfX2 GTTTGATGAAAQATTTGGGCTTG 

vegfX3 TTTCTAAAGGAAATCAAATTAG 

vegfX4 GATAAGATTTGTATCTGATG 

vegfX5 GATGTCTCCTCTTTCAG 

vegfX6 GCACAACTCCTAATTCTG 

vegfX7 AGCACCTGATTCCGTTGC 

vegfX8 TAGTACATAGAATGTTCTGG 

vegfX9 AAGAGACATACTTCTGTAC 

vegfXlO CCAGGTACAATAAGTGAACTG 
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Variants isolated by PCR (at 8/2/99, all cloned and sequenced at 

JRF) 



a b c d e f 



PCR primers- 



clone 22, 29, 41 C 



clone 1 5, 20 r 



clones 57, 25, 
26, 27 



2. 1kb clones 1, 
2,3 



Incytecontig I i ^^^r 
(8/12/98) 



clone 52, 59 I T ^*^ 



primers- a-vegfX1 b- vegfX2 c- vegfX5 

(see fig 3) d- vsgfXS e-vegfX9 f-vegfX10 
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/^tP.yf. VEGF-X 5' RACE primers 

vegfXl 1 CCTTTAGAAATCTGTTTTCCTGGTACAG 
vegfXl2 GGAAAATATTCATCAGATACAAATCTTATCC 
vegfX13 GGTCCAGTGGCAAAGCTGAAGG 
vegfXU CTGGTTCAAGATATCGAATAAGGTCTTCC 
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DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCTTC CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACGTCGGAAG GGGACCGCCA CCACTTTCTC 

61 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTGA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+2 MetSerLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGlnArg GlnGlyTh 
] 

121 AATGAGCCTC TTCGGGCTTC TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+2 rGlnAlaGlu SerAsnLeuSer SerLysPhe GlnPheSer SerAsnLysGlu GlnAsnGl 



181 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATCATTTAA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAsp ProGlnHisGlu Argllelle ThrValSer ThrAsnGlySer IleHisSe 



241 AGTACAAGAT CCTCAGCATG AGAGAATTAT TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGTTCTA GGAGTCGTAC TCTCTTAATA ATGACACAGA TGATTACCTT CATAAGTGTC 

+2 rProArgPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu ValAlaVa 



301 CCCAAGGTTT CCTCATACTT ATCCAAGAAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GGGTTCCAAA GGAGTATGAA TAGGTTCTTT ATGCCAGAAC CATACCTCTA ATCATCGTCA 

+2 IGluGluAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 



3 61 AGAGGAAAAT GTATGGATAC AACTTACGTT TGATGAAAGA TTTGGGCTTG AAGACCCAGA 
TCTCCTTTTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC TTCTGGGTCT 

+2 uAspAspIle CysLysTyrAsp PheValGlu ValGluGlu ProSerAspGly ThrlleLe 



421 AGATGACATA TGCAAGTATG ATTTTGTAGA AGTTGAGGAA CCCAGTGATG GAACTATATT 
TCTACTGTAT ACGTTCATAC TAAAACATCT TCAACTCCTT GGGTCACTAC CTTGATATAA 

+ 2 uGlyArgTrp CysGlySerGly ThrValPro GlyLysGln IleSerLysGly AsnGlnll 



4 81 AGGGCGCTGG TGTGGTTCTG GTACTGTACC AGGAAAACAG ATTTCTAAAG GAAATCAAAT 
TCCCGCGACC ACACCAAGAC CATGACATGG TCCTTTTGTC TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPhe ProSerGlu ProGlyPheCys IleHisTy 



541 TAGGATAAGA TTTGTATCTG ATGAATATTT TCCTTCTGAA CCAGGGTTCT GCATCCACTA 
ATCCTATTCT AAACATAGAC TACTTATAAA AGGAAGACTT GGTCCCAAGA CGTAGGTGAT 

+2 rAsnlleVal MetProGlnPhe ThrGluAla ValSerPro SerValLeuPro ProSerAl 



601 CAACATTGTC ATGCCACAAT TCACAGAAGC TGTGAGTCCT TCAGTGCTAC CCCCTTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCTTCG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+2 aLeuProLeu AspLeuLeuAsn -AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 



661 TTTGCCACTG GACCTGCTTA ATAATGCTAT AACTGCCTTT AGTACCTTGG AAGACCTTAT 
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+2 eArgTyrLeu GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 

721 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GATCTATATA GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCTGAATCTT CTAGATATAT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 

781 GCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC TAGACTTGGA 

+2 uLeuThrGlu GluValArgLeu TyrSerCys ThrProArg AsnPheSerVal SerlleAr 

841 TC T AAC AG AG GAGGTAAGAT TATACAGCTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGluGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 

901 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAACGCTG 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGCGAC 

+2 sGlyGlyAsn CysAlaCysCys LeuHisAsn CysAsnGlu CysGlnCysVal ProSerLy 

961 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACGGACAA CAGAGGTGTT AACGTTACTT AC AG TT AC AC AGGGTTCGTT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 

1021 AGTTACTAAA AAATACCACG AGGTCCTTCA GTTGAGACCA AAGACCGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 

+2 uHisLysSer LeuThrAspVal AlaLeuGlu HisHisGlu GluCysAspCys ValCysAr 

1081 GCACAAATCA CTCACCGACG TGGCCCTGGA GCACCATGAG GAGTGTGACT GTGTGTGCAG 
CGTGTTTAGT GAGTGGCTGC ACCGGGACCT CGTGGTACTC CTCACACTGA CACACACGTC 

+2 gGlySerThr GlyGly 
> 

1141 AGGGAGCACA GGAGGATAGC CGCATCACCA CCAGCAGCTC TTGCCCAGAG CTGTGCAGTG 
TCCCTCGTGT CCTCCTATCG GCGTAGTGGT GGTCGTCGAG AACGGGTCTC GACACGTCAC 

1201 CAGTGGCTGA TTCTATTAGA GAACGTATGC GTTATCTCCA TCCTTAATCT CAGTTGTTTG 
GTCACCGACT AAGATAATCT CTTGCATACG CAATAGAGGT AGGAATTAGA GTCAACAAAC 

1261 CTTCAAGGAC CTTTCATCTT CAGGATTTAC AGTGCATTCT GAAAGAGGAG ACATCAAACA 
GAAGTTCCTG GAAAGTAGAA GTCCTAAATG TCACGTAAGA CTTTCTCCTC TGTAGTTTGT 

1321 GAATTAGGAG TTGTGCAACA GCTCTTTTGA GAGGAGGCCT AAAGGACAGG AGAAAAGGTC 
CTTAATCCTC AACACGTTGT CGAGAAAACT CTCCTCCGGA TTTCCTGTCC TCTTTTCCAG 

1381 TTCAATCGTG GAAAGAAAAT TAAATGTTGT ATTAAATAGA TCACCAGCTA GTTTCAGAGT 
AAGTTAGCAC CTTTCTTTTA ATTTACAACA TAATTTATCT AGTGGTCGAT CAAAGTCTCA 

1441 TACCATGTAC GTATTCCACT AGCTGGGTTC TGTATTTCAG TTCTTTCGAT ACGGCTTAGG 
ATGGTACATG CATAAGGTGA TCGACCCAAG ACATAAAGTC AAGAAAGCTA TGCCGAATCC 

1501 GTAATGTCAG TACAGGAAAA AAACTGTGCA AGTGAGCACC TGATTCCGTT GCCTTGCTTA 

*" ~ — — — — — — — ~~ ~' — • — - - — — - — — — — — — — — — - — » •* — — - — « « m 
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1561 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA TCGTATAAAA TCTGGATTTT TTTTTTTTTT 
TGAGATTTCG AGGTACAGGA CCCGGATTTT AGCATATTTT AGACCTAAAA AAAAAAAAAA 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA CATTCTATGT ACTACAAACC TGGTTTTTAA 
AAACGAGTAT AAGTGTATAC ATTTGGTCTT GTAAGATACA TGATGTTTGG ACCAAAAATT 

1681 AAAGGAACTA TGTTGCTATG AATTAAACTT GTGTCGTGCT GATAGGACAG ACTGGATTTT 
TTTCCTTGAT ACAACGATAC TTAATTTGAA CACAGCACGA CTATCCTGTC TGACCTAAAA 

1741 TCATATTTCT TATTAAAATT TCTGCCATTT AGAAGAAGAG AACTACATTC ATGGTTTGGA 
AGTATAAAGA ATAATTTTAA AGACGGTAAA TCTTCTTCTC TTGATGTAAG TACCAAACCT 

18 01 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT ATCTTCACTT TATCGATAAG CCAGTTTATT 
TCTCTATTTG GACTTTTCTT CTCACCGGAA TAGAAGTGAA ATAGCTATTC GGTCAAATAA 

1861 TGTTTCATTG TGTACATTTT TATATTCTCC TTTTGACATT ATAACTGTTG GCTTTTCTAA 
ACAAAGTAAC ACATGTAAAA ATATAAGAGG AAAACTGTAA TATTGACAAC CGAAAAGATT 

1921 TCTTGTTAAA TATATCTATT TTTACCAAAG GTATTTAATA TTCTTTTTTA TGACAACTTA 
AGAACAATTT ATATAGATAA AAATGGTTTC CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

1981 GATCAACTAT TTTTAGCTTG GTAAATTTTT CTAAACACAA TTGTTATAGC CAGAGGAACA 
CTAGTTGATA AAAATCGAAC CATTTAAAAA GATTTGTGTT AACAATATCG GTCTCCTTGT 

2041 AAGATGATAT AAAATATTGT TGCTCTGACA AAAATACATG TATTTCATTC TCGTATGGTG 
TTCTACTATA TTTTATAACA ACGAGACTGT TTTTATGTAC ATAAAGTAAG AGCATACCAC 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA ACTGAATTGG AATAGAATTG GTAAGTTGCA 
GATCTCAATC T AATT AG AC G TAAAATTTTT TGACTTAACC TTATCTTAAC CATTCAACGT 

2161 AAGACTTTTT GAAAATAATT AAATTATCAT ATCTTCCATT CCTGTTATTG GAGATGAAAA 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA TAGAAGGTAA GGACAATAAC CTCTACTTTT 

2221 TAAAAAGCAA CTTATGAAAG TAGACATTCA GATCCAGCCA TTACTAACCT ATTCCTTTTT 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

2281 TGGGGAAATC TGAGCCTAGC TCAGAAAAAC ATAAAGCACC TTGAAAAAGA CTTGGCAGCT 
ACCCCTTTAG ACTCGGATCG AGTCTTTTTG TATTTCGTGG AACTTTTTCT GAACCGTCGA 

2341 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA GGAACACATC CTATTTATTG TGATGTTGTG 
AGGACTATTT CGCACGACAC GACACGTCAT CCTTGTGTAG GATAAATAAC ACTACAACAC 

2401 GTTTTATTAT CTTAAACTCT GTTCCATACA CTTGTATAAA TACATGGATA TTTTTATGTA 
CAAAATAATA GAATTTGAGA CAAGGTATGT GAACATATTT ATGTACCTAT AAAAATACAT 

2461 CAGAAGTATG TCTCT 
GTCTTCATAC AGAGA 
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Z 



New Sequence + Incyte ESTs 



i 



ATTTGTTTAA 



ACCTTGGGAA 



ACTGGTTCAG GTCCAGGTTT 



TGCTTTGATC CTTTTCAAAA 



TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

61 ACTGGAGACA CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTCTGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC CAGAGCAGGC TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GTCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGTGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLeu LeuLeuThrSer AlaLeuAl 

j 

241 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AGAGTGGGGT CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+2 aGlyGlnArg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 



3 01 CGGCCAGAGA CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

+2 rAsnLysGlu GlnTyrGlyVal GlnAspPro GlnHisGlu ArgllelleThr ValSerTh 



3 61 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

+2 rAsnGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLeuVa 



421 TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAGTGTCGG GTTCCAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

+2 lTrpArgLeu ValAlaValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



481 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CATCGTCATC TCCTTTTACA TACCTATGTT GAATGCAAAC TACTTTCTAA 

+2 eGlyLeuGlu AspProGluAsp AspIleCys LysTyrAsp PheValGluVal GluGluPr 



541 TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGGGTCTTC TACTGTATAC GTTCATACTA AAACATCTTC AACTCCTTGG 

+2 oSeirAspGly ThrlleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+2 eSerLysGly AsnGlnlleArg IleArgPhe ValSerAsp GluTyrPhePro SerGluPr 



6 61 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGACTTGG 
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+2 oGlyPheCys IleHisTyrAsn IleValMet ProGlnPhe ThrGluAlaVal SerProSe 



721 AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT TGTAACAGTA CGGTGTTAAG TGTCTTCGAC ACTCAGGAAG 

+2 rValLeuPro ProSerAlaLeu ProLeuAsp LeuLeuAsn AsnAlalleThr AlaPheSe 



781 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG 
TCACGATGGG GGAAGTCGAA ACGGTGACCT GGACGAATTA TTACGATATT GACGGAAATC 

+2 rThrLeuGlu AspLeuIleArg TyrLeuGlu ProGluArg TrpGlnLeuAsp LeuGluAs 



841 TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA 
ATGGAACCTT CTGGAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+2 pLeuTyrArg ProThrTrpGln LeuLeuGly LysAlaPhe ValPheGlyArg LysSerAr 



9 01 TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+2 gValValAsp LeuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



961 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThrAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+2 sLeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys LeuHisAsnCys AsnGluCy 



1081 TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal ThrLysLys TyrHisGlu ValLeuGlnLeu ArgProLy 



1141 TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAGACCAAA 
AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC CAGGAAGTCA ACTCTGGTTT 

+2 sThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 



1201 GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA 
CTGGCCACAG TCCCCTAACb TGTTTAGTGA GTGGCTGCAC CGGGACCTCG TGGTACTCCT 

+2 uCysAspCys ValCysArgGly SerThrGly Gly 
> 

1261 GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG CATCACCACC AGCAGCTCTT 
CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

1321 GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 
CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT TGCATACGCA ATAGAGGTAG 

13 81 CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCTGA 
GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT CCTAAATGTC ACGTAAGACT 
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1441 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT 
TTCTCCTCTG TAGTTTGTCT TAATCCTCAA 

1501 AGGACAGGAG AAAAGGTCTT CAATCGTGGA 
TCCTGTCCTC TTTTCCAGAA GTTAGCACCT 

1561 ACCAGCTAGT TTCAGAGTTA CCATGTACGT 
TGGTCGATCA AAGTCTCAAT GGTACATGCA 

1621 CTTTCGATAC GGCTTAGGGT AATGTCAGTA 
GAAAGCTATG CCGAATCCCA TTACAGTCAT 

1681 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT 
TAAGGCAACG GAACCGAATT GAGATTTCGA 

1741 CTGGATTTTT TTTTTTTTTT TTGCGCATAT 
GACCTAAAAA AAAAAAAAAA AACGCGTATA 

1801 CTACAAACCT GGTTTTTAAA AAGGAACTAT 
GATGTTTGGA CCAAAAATTT TTCCTTGATA 

1861 ATAGGACAGA CTGGATTTTT CATATTTCTT 
TATCCTGTCT GACCTAAAAA GTATAAAGAA 

1921 ACTACATTCA TGGTTTGGAA GAGATAAACC 
TGATGTAAGT ACCAAACCTT CTCTATTTGG 

1981 ATCGATAAGT CAGTTTATTT GTTTCATTGT 
TAGCTATTCA GTCAAATAAA CAAAGTAACA 

2041 TAACTGTTGG CTTTTCTAAT CTTGTTAAAT 
ATTGACAACC GAAAAGATTA GAACAATTTA 

2101 TCTTTTTTAT GACAACTTAG ATCAACTATT 
AGAAAAAATA CTGTTGAATC TAGTTGATAA 

2161 TGTTATAGCC AGAGGAACAA AGATGATATA 
ACAATATCGG TCTCCTTGTT TCTACTATAT 

2221 ATTTCATTCT CGTATGGTGC TAGAGTTAGA 
TAAAGTAAGA GCATACCACG ATCTCAATCT 

2281 ATAGAATTGG TAAGTTGCAA AGACTTTTTG 
TATCTTAACC ATTCAACGTT TCTGAAAAAC 

2341 CTGTTATTGG AGATGAAAAT AAAAAGCAAC 
GACAATAACC TCTACTTTTA TTTTTCGTTG 

2401 TACTAACCTA TTCCTTTTTT GGGGAAATCT 
ATGATTGGAT AAGGAAAAAA CCCCTTTAGA 

2461 TGAAAAAGAC TTGGCAGCTT CCTGATAAAG 
ACTTTTTCTG AACCGTCGAA GGACTATTTC 

2521 TATTTATTGT GATGTTGTGG TTTTATTATC 
ATAAATAACA CTACAACACC AAAATAATAG 



GTGCAACAGC TCTTTTGAGA GGAGGCCTAA 
CACGTTGTCG AGAAAACTCT CCTCCGGATT 

AAGAAAATTA AATGTTGTAT TAAATAGATC 
TTCTTTTAAT TTACAACATA ATTTATCTAG 

ATTCCACTAG CTGGGTTCTG TATTTCAGTT 
TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

CAGGAAAAAA ACTGTGCAAG TGAGCACCTG 
GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

CCATGTCCTG GGCCTAAAAT CGTATAAAAT 
GGTACAGGAC CCGGATTTTA GCATATTTTA 

TCACATATGT AAACCAGAAC ATTCTATGTA 
AGTGTATACA TTTGGTCTTG TAAGATACAT 

GTTGCTATGA ATTAAACTTG TGTCATGCTG 
CAACGATACT TAATTTGAAC ACAGTACGAC 

ATTAAAATTT CTGCCATTTA GAAGAAGAGA 
TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 
ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

GTACATTTTT ATATTCTCCT TTTGACATTA 
CATGTAAAAA TATAAGAGGA AAACTGTAAT 

ATATCTATTT TTACCAAAGG TATTTAATAT 
TATAGATAAA AATGGTTTCC ATAAATTATA 

TTTAGCTTGG TAAATTTTTC TAAACACAAT 
AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

AAATATTGTT GCTCTGACAA AAATACATGT 
TTTATAACAA CGAGACTGTT TTTATGTACA 

TTAATCTGCA TTTTAAAAAA CTGAATTGGA 
AATTAGACGT AAAATTTTTT GACTTAACCT 

AAAATAATTA AATTATCATA TCTTCCATTC 
TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

TTATGAAAGT AGACATTCAG ATCCAGCCAT 
AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

GAGCCTAGCT CAGAAAAACA TAAAGCACCT 
CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

CGTGCTGTGC TGTGCAGTAG GAACACATCC 
GCACGACACG ACACGTCATC CTTGTGTAGG 

TTAAACTCTG TTCCATACAC TTGTATAAAT 
AATTTGAGAC AAGGTATGTG AACATATTTA 
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2581 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 

TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 

GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC 

2701 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 

AATAC AC C AC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

2761 TTTTGGGGAG AAAATT 
AAAACCCCTC TTTTAA 



Additional olJgonucleotides used for amplification of entire 



coding region 



5'-1 



TTTGTTTAAACCTTGG G AAACTGG 



5'-2 



GTCCAGGTTTTGCTTTGATCC 
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/^7& DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 

1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC TTTTCAAAAA 
AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT GGAAACTACC 
GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCTA CCCTAATACA CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 
GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGGAC 

181 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 
CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+2 • MetSer LeuPheGly LeuLeuLeu LeuThrSerAla LeuAlaGl 

] 

241 CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGG 
GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GACTGTAGAC GGGACCGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSerAsn LeuSerSer LysPheGlnPhe SerSerAs 



3 01 CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GGACTCATCA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 



3 61 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT TAATAATGAC ACAGATGATT 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal LeuValTr 



421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnLeu ThrPheAspGlu ArgPheGl 



481 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProGluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 



541 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACTTCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC 

+2 rAspGlyThr IleLeuGlyArg TrpCysGly SerGlyThr ValProGlyLys GlnlleSe 



601 TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG 

+2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GluProGl 



661 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCAGG 
ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGTCC 
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+2 yPheCysIle HisTyrAsnlle ValMetPro GlnPheThr GluAlaValSer ProSerVa 



721 GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA GTCCTTCAGT 
CAAGACGTAG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAGTCA 

+2 lLeuProPro SerAlaLeuPro LeuAspLeu LeuAsnAsn AlalleThrAla PheSerTh 



7 81 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGGGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

+2 rLeuGluAsp LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAspLe 



841 CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

+2 uTyrArgPro ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SerArgVa 



901 ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+2 lValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCysThrPro ArgAsnPh 



961 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLe 



1021 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

+2 uLeuValLys ArgCysGlyGly AsnCysAla CysCysLeu HisAsnCysAsn GluCysGl 



1081 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+2 nCysValPro SerLysValThr LysLysTyr HisGluVal LeuGlnLeuArg ProLysTh 



1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACAGGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+2 rGlyValArg GlyLeuHisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



1201 CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGTCC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGG TACTCCTCAC 

+ 2 sAspCysVal CysArgGlySer ThrGlyGly 



1261 TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 
ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

1321 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 

13 81 AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 
TTAGAGTCAA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC 

1441 AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 
TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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. Predicted Full-length Polypeptide Sequence 

1 MSLFGLLLLT SALAGQRQGT QAESNLSSXF QFSSNKEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYPRN TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDICKYDFVE VEEPSDGTIL GRWCGSGTVP GKQISKGNQI RIRFVSDEYF 

151 PSSPGFCIHY NIVMPQFTEA VSPSVLPPSA LPLDLLNNAI TAFSTLEDLI 

201 RYLEPERWQL DLEDLYRPTW QLLGKAFVFG RXSRWDLNL LTEEVRLYSC 

251 TPRNFSVSIR EELKRTDTIF WPGCLLVKRC GGNCACCLHN CNECQCVPSX 

301 VTKKYHEVLQ LRPXTGVRGL HXSLTDVALE HHSECDCVCR GSTGG 
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Alignment of VEGF-X with Other VEGFs 



VEG F — HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD — HUMAN 
990126vegx 



20 



40 



MSLFGLLLLT S ALAGQRQGTQAE SNLSSKFQF S SNKEQNGVQDPQHERI I 



50 



VEGF„HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC„HUMAN 
VEGD_HUMAN 
990126vegx 



VEGF_HUMAN 
PLGF_ HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



VEGF — HUMAN 
PLGF_HUMAN 
VEGB„HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



60 * 80 * 100 

TVSTNGSIHSPRFPHTYPRNWLVWRLVAVEENVWIQLTFDERFGLEDPE 
* 120 * 140 * 

MHLLGF F S VAC S LLAAALL PGPREAPAAAA 

MYREWWVNV 

DDICKYDFVEV--EEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDE 

160 * 180 * 200 


Mp 

AFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKM 
FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE 
YFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLED 



100 



30 
10 
148 



2 
2 

80 
60 
198 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



* 220 * 240 * 

fllswvhwslalllylhhakwsqaapmaegggqnhhevvkfmd-vyqgsy : 51 

vmrlfpcflqllaglalpavppqqwalsagngssevewpfqe-vwggsy : 51 

— mspllrrlllaallqi^paqapvsqpdapghqrkwswid-vytSat : 46 

ykcqlrkggwqhnreqanlnsrteetikfaaahynteilksidnewrHtq : 13 0 

dwklwrcrlrlksftsmdsrsashrstrfaatfydietlkvideewqgtq : 110 

liryleperwqldledlyrptwqllgkafvfgrksrvvdij^lteevsly : 248 



VEGF„HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



260 

CHPIETLVDIFQ 



280 



CRALERLVDWS a YPSEVEHMFS 
CQPREWVPLTV \ LMGTVAKQI/ 



CMPREVCIDVGK : 
CSPRETCVEVAS 



SCT PRNF SVS I RjgELKRTDTI F 1 



ypdeieyif: 



fgvatntff: 
Ilgkstntff: 




— egl: 
|gd — enl: 
pd—dgl: 

S--EGLQ 
E--ESLI 
|LHNCNECQ 




96 
96 
91 
175 
155 
298 
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VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGDJKUMAN 
99012 6 vegx 



teesnitmq[ 
vetanvtm* 

TGQHQVRMQS 
TSTSYLSKT 
TSTSYISKQ 



320 
KPHQG— 

jLKf RSGDR 

|LM \ RYPS 

FEjTVPLSQG--- 



QHIGEffiSFLQ 
PSYVEWTFSQ 
SQLGDTiSLE 
PKPVTQSR 



.JFE j SVPLTSV PELVPffiKV._ 

skvtkkyheBlqhrpktgwglhksltdBale: 




340 * 

IRPKKDRARQEK : 141 

RPLREKMKPER : 141 

IRPKKKDSAVKP : 13 5 

SKLDVYRQVH : 222 

LPTAPRHPYSI : 202 

'CRGSTGG : 345 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGDJiUMAN 
990126vegx 



360 * 380 
KSVRGKGKGQKRKRKKSRYKSWSVP 



400 



166 




VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6 vegx 



* 420 * 440 * 

TDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKNK 
EDHSHLQEPALCGP 



322 
260 



VEGF_HUMAN 
PLGF„HUMAN 
VEGBJKUMAN 
VEGC_HUMAN 
VEGDJIUMAN 
990126vegx 



460 * 480 * 500 

CGPCSERRKHLFVQDPQTCKC - SCKNTDSRCKARQLELNER : 206 

CGDAVPRR : 149 

PLCPRCTQHHQRPDPRTCRCRCRRRSFLRCQGRGLELNPD : 17 9 

LFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLK : 372 

HMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPD : 310 



* 520 * 540 * 

VEGF_HUMAN : TCRCDKPRR : 215 

PLGF_HUMAN : : 

VEGB_HUMAN : TCRCRKLRR : 188 

VEGC_HUMAN : GKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS : 419 

VEGDJIUMAN : TCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP : 354 

990126vegx : : 
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Variant Polypeptide Sequences 



FL_seq 

clone41 

clone20 



20 



40 



MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
" -SLFGLLLLTSALAGPROGTOAESNLSSKFOFSSNKEOWGyODPOHERII 



50 
50 
50 



FL_seq 
clone41 
clone2 0 



60 



80 



100 



TVSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVl^IQLTFDERFGLEDPEl 
TVSTWG SIHS PRFPHTYPRNTVLVl'JRLVAVEENVl^I QLTFDERFGLEDPE 
TVSTNG S I H S PRF PHTYPRNTVLVVJRLVAVEENVWI QLTFDERFGLEDPE 



100 
100 
100 



FL_seq 

clone41 

clone20 



120 



140 



DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKOI5KGNOIRIRFVSDEYF 



150 
150 
150 



FL_seq 

clone41 

clone20 



PSEP 
PSEP 
PSEP 



160 * 180 * 200 

GiFCIHYNHvEPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLEDLI : 200 

SNRGGKinQgHTS : 167 

GFCIHYWE^PQFTEAVSPSVLPPSALPLDLLNNAITAFSTLEDLI : 200 



FL_seq 

clone41 

clone20 



* 220 * 240 * 

RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRVVDLNLLTEEVRLYSC : 250 

RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLITLLTE : 243 



FL — seq 

clone41 

clone20 



260 * 280 * 300 

TPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSK 



300 



FL_seq 

clone41 

clone20 



* 320 * 340 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 

EVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



345 
282 
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Primers for Expression of VEGF-X 

E.coli expression of domain- 

vegx-6 AATTGGATCCGAGAGTGGTGGATCTGAACC 

vegx-7 AATTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTG AATTCATTA I Mil I AGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

. Baculovirus/insect cell expression of full-length protein- 
vegbad 

AATTGGATCCGGAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegb ac2 AATTG AATTCGCTATCCTCCTGTGCTCCCTCTGC 
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>3993180H1 LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAA 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCA^ 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAAT^ 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>2559870H1 ADRETUT01 . INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTC 

TC TTC AGGATTTAC AGTGC ATTC TGAAAG AGGAGA 

>3979767H1" "LUNGTOT08 - " INCYTE ~" 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 
GTTATCTCCATCCTTAATCTGAGTTGTTTGCTTCAAGGACCTTT 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGG 
GAAAG AANATTAAATGTTGT ATTAAAT AGACAC C AGCT 
>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTC 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTG^ 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTC 

GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTC 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAAT^ 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>1302516H1 PLACNOT02 INCYTE 

AGG AAATC AAATTAGG AT AAGATTTGTATCTGATGAAT ATTTTC CTTC TGAAC CTTC T AACAGAGGAG GTAAG ATTAT AC 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTC 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGT? 

ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTG^ 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGC 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATA 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>4713188H1 BRAIHCT01 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAA 

ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 

CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATC 

TTGCT 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 

GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG^ 

C AAC AGC TC TTTTG AGAGGAGGC C T AAAGGNC AGGAGAAAAGGTCTTC AATC GTG GAAAGAAAATT AAATGTTGT ATT AA 

ATAGATC 

>1303909H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGG 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTC? 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
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>2739211H1 OVARNOT09 * INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 

>3325591H1 PTHYNOT0 3 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGA 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 

CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATG 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT0 1 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>4163378H1 BRSTNOT3 2 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTC 

ATGTCCTGGGCCTAAAATCGTATA 
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>2054675H1 BEPINOT01 INCYTE 

AAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAA^ 
TCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAA 

TATCGATAAGTC AGTTTATTTGTTTC ATTGTGTAC ATTTTTATATTC TC CTTTTGAC ATTATAA'CTGTTGGCTTTTC TAA 

TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

>3993180H1 LUNGNONO 3 INCYTE 

C AC AAATC ACTC ACC GACGTGGC C CTGGAGC ACC ATGAGGNGTGTG ACTGTGTGTGCAGAGGGAGC AC AGGAGGATAGC C 
GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 
CCTTAATC TCAGTTG TTTGCTTC AAGGAC CTTTC ATCTTC AGGATTTAC AGTGCATTCTGAAAGAGGAGAC ATC AAAC AG 
AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 
>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAAT^ 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>4164633H1 BRSTNOT3 2 INCYTE 

CTTGTTAAATATATC TATTTTTAC C AAAGGTATTTAATATTCTTTANTTATGAC AACTTAGATC AACTATTTTTAGCTTG 
GTAAATTTTTCT AAAC AC AATTGTT AT AGC C AG AGGAAC AAAGATG ATATAAAATATTGTTGC TCTG AC AAAAATAC ATG 
TATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAA 

AAGACTTTTTGANAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGGGGAGAAAAT 
>255987 0H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 
TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 
AGTGC AGTGGCTGATTC TATTAGAGAACGTATGCGTT ATC TC CATCCTTAATCTCAGTTGTTTGCTTC AAGGAC CTTTOV 
TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 
>3817470H1 BONSTUT0 1 INCYTE 

TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTT 
AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTC 

ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGA 
TAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTT 
>3979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGC ATC AC C ACCAGC AG CTC TTGCCCAGAGCTGTGCAGTGCAGTGG CTGATTC TATTAGAG AACGTATGC 
GTTATCTCC ATCCTTAATC TCAGTTGTTTGCTTC AAGGAC C TTTC ATCTTC AGGATTTAC AGTGCATTC TGAAAGAGGAG 
ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTC^ 
GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 
>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 
GTTATCTCCATCCTTAATC TCAGTTG TTTGCTTC AAGGAC CTTTC ATCTTC AGGATTTAC ATGCATTCTGA^ 
CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 
AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 
>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 
GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCC GC ATC ACC AC C A 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCC AGAGTGGTGGATCTGAAC C TTCTAAC AGAGGAGGTAAGATTATAC AGC TGCAC ACCTCGTAACTTCTC AGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGA 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>862169H1 BRAITUT03 INCYTE 

AGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCA 
TTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATT 

CTGTTATTGGAGATGAAAATAAAAAGC AACTTATGAAAGTAGAC ATTC AGATCCAGC C ATT AC T AACCTATTC CTTTTTT 
GGGGAAATC TGAGCCT AGC 

>4201385H1 BRAITUT29 INCYTE 

TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCT 

TAAAATTTCTGCCATTTAG AAGAAGAGAAC TAC ATTC ATGGTTTGGAAGAGAT AAAC C TGAAAAGAAGAGTGGCCTATC T 
TCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 
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CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 
>1302516K1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 
ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>2549720H1 LUNGTUT06 . INCYTE 

TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAA 
AATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGT 
AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACT 
TATGANAGTAG 

>877279H1 LUNG AST 01 INCYTE 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCAT 
TTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGGCTTTTTGAAAATAATTAAATTATCATATCTTCCATTCC 
TGTTATTGGNGG 

>4713188H1 BRAIHCT01 INCYTE 

C AAAGTTACTAAAAAATACC ACGAGGTC CTTC AGTTGAGAC C AAAGAC C GGTGTC AGGGGATTGC ACAAATC ACTC ACCG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 
TTGCT 

>2171082H1 ENDCNOT03 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTA 

TATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTC 

GATGA 

>8758 60H1 LUNG AST 01 INCYTE 

CTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACC 

TGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCT 

TTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATT^ 

GAC 

>706168H1 SYNORAT04 INCYTE 

GCTC AT ATTCAC ATATGTAAACCAGAAC ATTCTATGTAC TAC AAAC CTGGTTTTTAAAAAGGANC TATGTTGC TATGAAT 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCANTTTATCGATAAGTCAGTTTATTTGT 
TTCA 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 
GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTG 
CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAA 
ATAGATC 

>53843 6H1 LNODNOT02 INCYTE 

AAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAAT 
TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
>1303909H1 PLACNOT 0 2 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
>2739211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 

GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 

TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAG 

GTGAGCACCTGAT 
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>2 5 5 03 4 3 HI LUNGTUT06 INCYTE 

TGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGG^ 

GTATrTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGC 
CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 
>5321148H1 FIBPFEN06 INCYTE 

C AC AATTGTTATAGCC AGAGGAAC AAAGATG ATATAAAATATTGTTGCTC TGNC AAAAAT AC ATGTATTTCATTCTC GTA 
TGGTGCTAG AGTTAGATTAATC TGC ATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGC AAAGAC TTTTTGAAAA 
TAATTAAATTATCATATCTTCCATTCCTGTTATT 
CATTACTAAC 

>879495H1 THYRNOT 0 2 INCYTE 

ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGC^ 
AGACTTTTTGAAAATAATTAAATTATCATATCTTCCArrcCTGTTATTGGAGATGAAAATAAA^ 
AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCAC 
TGAAAAA 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGA^ 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>543890H1 OVARNOT0 2 INCYTE 

TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 
TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGNATAGAATTGGTAAGTTGCAAAGNCT^ 
TTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGGATGGAAAATAAAAAGCAACTTATGGAAAGTA^ 
ACATTCAGATC 

>3733565H1 SMCCNOS0 1 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>4 641 93 9H1 PRO STMT 03 INCYTE 

GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 
TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGNAGAGATAAACCTGAAAA 
GAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATGTGTACATTTTTATATTCTCCTTTGACAT 
ATAACGTGGCTTT 

>2007780H1 TESTNOT03 INCYTE 

TTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATT^ 
ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAAC 
AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 
>3085331H1 HEAONOT03 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAA^ 

AAACTTGTGTCGTGCTGATAGGAC AGAC TGGNTTTTTC ATATTTC TTATTANAATTTCTGCC ATTAGAAGAAGAGAAC T A 
CArrCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTCACTTTATCGATAAGTCAGT 
>3414043H1 PTHYNOT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAAT 
TAAACTTGTGTC GTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTC TGCC ATTTAGAAG AAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAA 
>3705963H1 PENCNOT0 7 INCYTE 

ANACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAA 
TCTGGAnnnnnrmnnnrmnnrmrmGCTCATATTC 

AAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

TTCTGCCATTAGAAGAAGAGAACTACNTTCANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 

>5137051H1 OVARDIT04 INCYTE 

AAAAAAC TGAATTGGAATAGAATTGGTAAGTTGC AAAGAC TNTTTGAAAATAATTAAATTATC ATATC TTCC ATTCCTGT 

TATTGGAG ATGAANATAAAAAGC AACTTATGAAAGTAGAC ATTC AGATCC AGC C ATTAC TAACCTATTCCTTTTTTGGGG 

AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTG1OTGTCA 

GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 

>3554223H1 S YNONOT0 1 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCA 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 
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CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGN 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT01 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>1955646H1 CONNNOT01 INCYTE 

TGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGC 

AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAA 

ACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTATATCCTAAACC 

>4163378H1 BRSTNOT3 2 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 

ATGTCCTGGGCCTAAAATCGTATA 

>5095141H1 EPIMNON05 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATNTTCACTTTATCGATAAGTCAGNTTATTTGTTTCATTGTGTACATTTNNA 
TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCTAANCNTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 
CTTT 

>943826H1 ADRENOT03 INCYTE 

TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAA 

AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 

>3451273H1 UTRSNON03 INCYTE 

TTTTTTNTTTTGCTCATATTCACATATGTAAACCNGAACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAACTATG 
TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCANATTTCTTANTAANNTTTCTGCCATTTAG 
AAGA 

>1402278H1 LATRTUT02 INCYTE 

GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 
ATCGTAT AAAATCTGGAnnrmnnrmnnrmrmnnnnGC TC ATATTC ACAT ATGTAAAC C AGAAC ATTCTATGTAC TAC AAA 
CCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTT 
CTTA 

>4361191H1 SKIRNOT01 INCYTE 

GCAAAGACTTTTTGANAATNATTAANTTATCATATCTTCCATTCCTGTTATNGGAGATGANAATAAAAAGCAACTTATGA 
AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 
ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCNATTTATTGTGNTGTN 
GNGGTTTTATGATC 

>1307017H1 PLACNOT02 INCYTE 

TGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGC 
CTAAAATCGTATAAAATCTGGAnnrmnnnnrmnnnnnnnnGCTCATATTCACA 

ACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCA 
TAT 

>5032225H1 HEARFET03 INCYTE 

AATTATC ATATC TTCC ATTC CTGTTATTGGAGATGNAAAT AAAAAGC AAC TTATGAAAGTAGAC ATTC AGATCC AGCC AT 
TACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 
CTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCGTT 
CCAT 

>3732621H1 SMCCNOS01 INCYTE 

ANAGATGATATAAAANATTGTTGCTCTGACAANNATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 

CNTTTTAAAAAACTGANTTGGAATAGANTTGGTAAGT 

>3530274H1 BLADNOT09 INCYTE 

TTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATT 
CCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCG 
TGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCCATACACTTG 
TATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCA 1 
>3530249H1 BLADNOT09 INCYTE 

CTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGC 
GTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACT 
TGTAT AAATACATGGAT ATTTTTATGTAC AGAAGTATGTC TC TTAACC AGTTC ACTTATTGTACCTGG 
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VEGFE1 


AAAATGTATGGATACAACTTAC 


22 


VEGFE2 


Giri'GATGAAAGATTTGGGCTTG 


23 


VEGFE3 


TTTCTAAAGGAAATCAAATTAG 


22 


VEGFE4 


GATAAGATTTGTATCTGATG 


20 


VEGFE5 


GATGTCTCCTCTTTCAG 


17 


VEGFE6 


GCACAACTCCTAATTCTG 


18 


VEGFE7 


AGCACCTGATTCCGTTGC 


19 


VEGFE8 


TAGTACATAGAATGTTCTGG 


20 


VEGFE9 


AAGAGACATACTTCTGTAC 


19 


VEGFE10 


CCAGGTACAATAAGTGAACTG 


21 
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+ 3 M N I F L L 

NLLT EEV RLY 

] 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF S V ■ S IREE LKR 
TDT IFWP G CL 

3 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 

-2 < 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+3TKKY HEV LQLR PKT GVR 
GLHK SLT D V A 



+ 1 V S G 

DCT NHSP T W P 

] 



241 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 

-2 

t 



+ 3LEHH EEC DCV CRGS TGG 
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> 

+2 V Q R E H R R 

IAAS PPA ALA 

] 



+ 1 WST MRSV TVC AEG AQ ED 
SRI TTS SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+2QS.CA VQW L I L LENV CVI 
SIL NLSC LLQ 



+ 1PELC SAV ADSI RER MRY 

LHP 
> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC AT AC GC AATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+2 GPF IFRI YSA F 
> 

481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 

CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG . - 

801 CTAAAATCGT ATAAAATCTG GA 
GATTTTAGCA TATTTTAGAC CT 
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+ 3 M N I F L L 

NLLT EEV R - L Y 

] 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDTIFWPGCL 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 
-2 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV LQLR PKT GVR 
GLHK SLT D V A 



^ ± V S G 

DCT NHSP TWP 

] 



241 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG ■ GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 
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-2 

-[ 

+ 3LEHH EEC DCV CRGS TGG 



> 

+ 2 VQREHRR 
IAAS PPA ALA 

] 



+ 1 WST MRSV TV C AEG AQED 
SRI T T 3 SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+2QSCA VQW LIL LENV CVI 
SIL NLSC LLQ 



+ 1PELC SAV ADSI RER MRY 

LHP 



401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 GPF IFRI YSA F 



481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 
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CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG 

801 CTAAAATCGT ATAAAATCTG GATTTTTTTN TTTTTTTTTG CGCATATTCA 
CATATGTAAA CCAGAACATT CTATGTACTA 

GATTTTAGCA TATTTTAGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT 
GTATACATTT GGTCTTGTAA GATACATGAT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 
CGTGCTGATA GGACAGACTG GATTTTTCAT 

GTTTGGACCA AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA 
GCACGACTAT CCTGTCTGAC CTAAAAAGTA 

-3 < 



9 61 ATTTCTTATT AAAATTTCTG CCATTTAGAA GAAGAGAACT ACATTCATGG 
TTTGGAAGAG ATAAACCTGA AAAGAAGAGT 

TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTACC 
AAACCTTCTC TATTTGGACT TTTCTTCTCA 

-3 



1041 GGCCTTATCT TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA 
CATTTTTATA TTCTCCTTTT GACATTATAA 

CCGGAATAGA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAACACAT 
GTAAAAATAT AAGAGGAAAA CTGTAATATT 
"3 [ 

1121 CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 
TTAATATTCT TTTTTATGAC AACTTAGATC 

GACAACCGAA AAGATTAGAA CAATTTATAT AGATAAAAAT GGTTTCCATA" 
AATTATAAGA AAAAATACTG TTGAATCTAG 

12 01 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGCCAGA 
GGAACAAAGA TGATATAAAA TATTGTTGCT 

TTGATAAAAA TCGAACCATT TAAAAAGATT TGTGTTAACA ATATCGGTCT 
CCTTGTTTCT ACTATATTTT ATAACAACGA 

1281 CTGACAAAAA TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA 
ATCTGCATTT TAAAAAACTG AATTGGAATA 

GACTGTTTTT ATGTACATAA AGTAAGAGCA TACCACGATC TCAATCTAAT 
TAGACGTAAA ATTTTTTGAC TTAACCTTAT 

13 61 GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 
TCCATTCCTG TTATTGGAGA TGAAAATAAA 

CTTAACCATT CAACGTTTCT GAAAAACTTT TATTAATTTA ATAGTATAGA 
AGGTAAGGAC AATAACCTCT ACTTTTATTT 

1441 AAGCAACTTA TGAAAGTAGA CATTCAGATC CAGCCATTAC TAACCTATTC 
CTTTTTTGGG GAAATCTGAG CCTAGCTCAG 

TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTCGGTAATG ATTGGATAAG 
GAAAAAACCC CTTTAGACTC GGATCGAGTC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 PCT/US99/30503 

32 I 54 



1521 AAAAACATAA AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT 
GCTGTGCTGT GCAGTAGGAA CACATCCTAT 

TTTTTGTATT TCGTGGAACT TTTTCTGAAC CGTCGAAGGA CTATTTCGCA 
CGACACGACA CGTCATCCTT GTGTAGGATA 

1601 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 
TATAAATACA TGGATATTTT TATGTACAGA 

AATAACACTA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC 
ATATTTATGT ACCTATAAAA ATACATGTCT 

1681 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TCATACAGAG AATTGGTCAA GTGAATAACA TGGACC 
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r - DNA and polypeptide sequence used for mammalian cell expression 

+ 1 mslfglllltsalagqr 
1 GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

+ lqgtq aES NLS SKFQ ?SS NKE 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG CAACAAGGAA 

+ 1QNGV QDP QHE RIIT VST NGS 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTG7GTCTAC TAATGGAAGT 

+ 1IHSP R F P HTY P R N T VLV WRL 
181 ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAA7A CGGTCTTGGT ATGGAGATTA 

+ 1VAVE E N V WIQ L T F D ERF GLE 
241 GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT TGGGCTTGAA 

+ i "d" p e d d ic kyd FVEV EEP SDG 
301 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

+ 1TILG RWC GSG TVPG KQI SKG 
361 ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT TTCTAAAGGA 

+ 1NQIR IRF VSD E Y F P SEP GFC 
421 AATCAAATTA GGA7AAGATT TGTATCTGAT GAATATT7TC CTTC7GAACC AGGGTTCTGC 

+ 1 I H Y N IVM PQF TEAV SPS VLP 
481 ATCCACTACA ACA7TGTCAT GCCACAATTC ACAGAAGCTG TGAG7CCTTC AGTGCTACCC 

+ 1PSAL ?LD LLN NAIT AFS TLE 
541 CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTA7AA CTGCCTTTAG TACCTTGGAA 

+ 1DLIR YLE PER WQLD LED LYR 
601 GACCTTATTC GA7A7CTTGA ACCAGAGAGA TGGCAGT7GG AC77AGAAGA TCTATATAGG 

+ 1PTWQ LLG KAF VFGR K S R VVD 
661 CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTT7TGGAA GAAAATCCAG AG7GGTGGAT 

+ 1LNLL 7EE VRL YSCT PRN F SV 
721 CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACC7CGTAA CTTCTCAGTG 

+1SIRE ELK RTD TIFW PGC L L V 
781 TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG TCTCCTGGTT 

+ 1KRCG GMC ACC L H N C NEC QCV 
841 AAACGC7GTG GTGGGAACTG TGCCTGTTGT CTCCACAA77 GCAA7GAATG TCAATGTGTC 

+ 1PSKV TKK YHE VLQL ?. PK TGV 
901 CCAAGCAAAG T7AC7AAAAA ATACCACGAG GTCC77CAG7 TGAGACCAAA GACCGGTGTC 

+1RGLH KSL TDV ALEH HEE CDC 
961 AGGGGAT7GC ACAAA7CACT CACCGACGTG GCCC7GGAGC ACCA7GAGGA GTGTGACTG7 

+ 1VCRG STG G SR G?FE G K ? IPN 
1021 GTG7GCAGAG GGAGCACAGG AGGATCTAGA GGGCCC77CG AAGG7AAGCC TATCCC7AAC 

+ 1 P L L G LPS T '" R T G H H H K H H 
1081 CCTCTCCTCG G7CTCGATTC TACGCGTACC GGTCA7CA7C ACCA7CACCA TTGA 
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DNA and polypeptide sequence used for baculovirus/insect cell expression 

1 GAATTCAAAG GCC7GTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

+3 mkf Ivn.valv fmv vyi syi 
61 ATATGAAATT CTTAGTCAAC GTTGCCCTTG TTTTTATGGT CGTATACATT TCTTACATCT 

+ 3 y a DP gTSH KHHK H ES NLS SKF 
121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+ 3QF SS N K E QNGV Q D P QHE RII 
181 AGTTTTCCAG CAACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+ 3TVST NGS IHSP RFP HTY PRN 
241 CTGTGTCTAC TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA 

+3TVLV WRL VAVE ENV WIQ L T F 

3 01 CGGTCTTGGT ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

+ 3DERF GLE DPED DIC KYD FVE 
361 ATGAAAGATT TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+3 VEEP SDG TILG RWC GSG TVP 
421 TTGAGGAACC CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

+3GKQI SKG NQIR IRF VSD EYF 

4 81 GAAAACAGAT TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC 

+ 3PSEP GFC IKYN IVM PQF TEA 
541 CTTCTGAACC .-.GGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

+ 3VSPS VLP FSAL PLD L L N N A I 
601 TGAGTCCTTC AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA 

+ 3TAFS TLE DLIR YLE PER WQL 
661 CTGCCTTTAG TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG 

+ 3 D It E D LYR PTWQ LLG KAF VFG 
721 ACTTAGAAGA TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+ 3RKSR VVD LNLL TEE VRL YSC 
781 GAAAATCCAG AGTC-GTGGAT C7GAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

+ 3TPRN FSV SIRE ELK RTD TIF 
841 CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT 

+ 3WPGC L L V KRCG GNC ACC LKN 
901 GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT 

+3CNEC OCV PSKV TKK .YHE VLQ 
961 GCAATGAATG TCAA7GTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT 

+ 3LRPK TGV RGLH.KSL TDV ALE 
1021 TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC 

+ 3HHEE SDC VCRG STG G 
1081 ACCATGAGGA GTGTGACTGT G7G7GCAGAG GGAGCACAGG AGGATAGCTC TAGA 
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+3 OTN SSS NNNN NNN MNN LGI 
1 CGCAGACTAA TTCGAGCTCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG 

+ 3 E G R I S E F ESNL SSK FQF SSN 
61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATTCCAGTTT TCCAGCAACA 

+ 3KEQN GVQ DPQH ERI ITV STN 
121 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

+ 3GSIH SPR FPHT YPR NTV LVW 
181 GAAGTA7TCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTGGTATGGA 

+ 3RLVA VEE NVWI QLT FDE RFG 
241 GATTAGTAGC AGTAGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

+ 3 "L "E " D P E DD ICKY DFV EVE EPS 
301 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+ 3DGTI LGR WCGS GTV PGK QIS 
361 ATGGAACTAT ATTAGGGCGC TGGTGTGGTT CTGGTACTGT ACCAGGAAAA CAGATTTCTA 

+ 3KGKQ IRI RFVS D E Y FPS EPG 
421 AAGGAAATCA AATTAGGATA AGATTTGTAT CTGATGAATA TTTTCCTTCT GAACCAGGGT 

+ 3FCIH YNI VMPQ FTE AVS PSV 
481 TCTGCATCCA CTACAACATT GTCATGCCAC AATTCACAGA AGCTGTGAGT CCTTCAGTGC 

+ 3LPPS ALP LDLL NNA I T A FST 
541 TACCCCCTTC AGCTTTGCCA CTGGACCTGC TTAATAATGC TATAACTGCC TTTAGTACCT 

+ 3LEDL IRY LEPE RWQ L D L EDL 
601 TGGAAGACCT TATTCGATAT CTTGAACCAG AGAGATGGCA GTTGGACTTA GAAGATCTAT 

+ 3YRPT W Q L LGKA F V F GRK S R V ' 
661 ATAGGCCAAC TTGGCAACTT CTTGGCAAGG CTTTTGTTTT TGGAAGAAAA TCCAGAGTGG 

+ 3VDLN LLT EEVR LYS CTP RNF 
721 TGGATC7GAA CCTTCTAACA GAGGAGGTAA GATTATACAG CTGCACACCT CGTAACTTCT 

+ 3SVSI REE LKRT DTI FWP G C I# 
781 CAGTGTCCAT AAGGGAAGAA CTAAAGAGAA CCGATACCAT TTTCTGGCCA GGTTGTCTCC 

+ 3LVKR CGG NCAC CLK NCN ECQ 
841 TGGTTAAACG CTGTGGTGGG AACTGTGCCT GTTGTCTCCA CAATTGCAAT GAATGTCAAT 

+ 3CVPS KVT KKYH EVL QLR PKT 
901 GTGTCCCAAG CAAAGTTACT AAAAAATACC ACGAGGTCCT TCAGTTGAGA CCAAAGACCG 

+ 3 G V ?. G LHK SLTD V A L EHH EEC 
961 GTGTCAGGGG ATTGCACAAA TCACTCACCG ACGTGGCCCT GGAGCACCAT GAGGAGTGTG 

+ 3 D C V C ?. G S T G G K H H K K H * 
1021 ACTGTG7GTG CAC-AGGGAGC ACAGGAGGAC ATCATCACCA TCACCATTGA TCTAGAGTCG 

1081 ACCTGCAGGC AAGC77 
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Disulphide-iinked dimerisation of VEGF-X 



(A) Mammalian cell expression 
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(B) E.coli expression 
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Glycosylation of VEGF-X 




Full-length product 



C-terminal proteolysis fragment 
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DNA and polypeptide sequence usedfor E.coli expression of the PDGF-like domain 

+ 3 MR GSHH HHH HGM ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC ATCATCATCA TCATGGTATG GCTAGCATGA 

+ 3 TGGO QMG RDLY DDD DKD P G R 
61 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGGAAGAA 

+3KSRV VDL NLLT EEV RLY SCT 
121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGG7 AAGATTATAC AGCTGCACAC 

♦3 P R N F SVS IREE LKR TDT IFW 
181 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+3PGCL LVK RCGG NCA CCL KNC 
241 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+ 3NECQ CVP SKV T KKY HEV L Q L 
301 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA 

+ 3RPKT GVR GLKK SLT D V A LEH 
361 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+3HEEC DCV CRGS TGG 
421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAATGAATT CGAAGCTTGA • 

4 81 TCCGGCTGCT AACAAAGCCC 
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Expression of PDGF domain in Exoli 
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DNA and polypeptide sequence used for E.coli expression of the CUB-Iike domain 

+2 MA M DIG INS DPE SHHH HHH 

1 GGCGATGGCC A7GGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+ 2 ESN uSSK FQF SSN KEQN GVQ 
61 TGAATCCAAC CTGAGTAGTA AATTCCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

+ 2 DPQ HERI ITV STN GSIH S P R 
121 AGATCCTCAG CATGAGAGAA TTATTACTGT GTCTACTAAT GGAAGTATTC ACAGCCCAAG 

+ 2 F P H TYPR NTV L VW RLVA VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAGTAGAGGA 

+2 NVW IQLT FDE RFG LEDP EDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGATGA 

+ 2 ICK YDFV EVE EPS DGTI LGR 
301 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAGT GATGGAACTA TATTAGGGCG 

+2 WCG SGTV PGK QIS KGNQ IRI 
361 CTGGTGTGGT TCTGGTACTG TACCAGGAAA ACAGATTTCT AAAGGAAATC AAATTAGGAT 

+2 RFV SDE. Y FPS EPG FCIH YNI 
421 AAGATTTGTA TCTGATGAAT ATTTTCCTTC TGAACCAGGG TTCTGCATCC ACTACAACAT 

+2 VMP CFTE AV 
481 TGTCATGCCA CAATTCACAG AAGCTGTGTA GTCGAGCTCC GTCGACAAGC TTGCGGCCGC 

541 ACTCGAGCAC 
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Expression of the CUB domain in Exoli 

M CUB 
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Partial intron/exon structure of the VEGF-X gene 
(A) - Genomic DNA sequences of 2 exons determined by sequencing 



.tttcttttataccatatagtgg-cgatctgaaccagGGTTCTG^ 

AGTCCTTCAGTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTrAATAATGCTATAACTGCCTTTAGT^ 
TCGATATCTTGAACCAGAGAGATC<;CAGTTGGACTTAGAAGA 

TTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTG 
TCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAAC^ 

TTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAJ^TTACTAAAAAATACCACGAGg t agg tatacaatttccttttt 
ggtttccttcgggtattttatgrcct 

aaagccagccacagacattcgttgatttttaaaagtggcttactcctacccccttccagGTCCTTCAGTTGAGACCAAAGACCGGT 
GTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCA 
ATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGT^ 
CCTTAATCTCAGTTGTTTGCTTCA&GGACCTTTCATCTT 

GAGTTGTGCAACAGCTCITTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCT^ 
AAATAGATCACGAGCTAGTTTCAGAGTTACCATGTACGTATTCC^ 

GGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAGCT 

CTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCGCATATTCACATATGT 

TGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGG^ 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGA 

TCGATAAGTCAGTTTATTTG7TTCATTGTGTACATTTTTATATTCTCCTTTTGACATTA 

AATATATCTATTTTTACCAAAC^TATTTAA^ 

ACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTC 
CTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAG 
TCATATCrTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAG 
TCCTTTTTTGGGGAAA7CTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAM 

TGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTT7ATTATCTTAAACTCTGTTCCATACACTTGTAT 

TGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCT 

TGCTTGTAAAATGCTTAATATCGTGCCT^ 

GGCTATTTTGGGGAGAAAATTatgcgtgtgtgcgctcaagatctattccttggaccctgagaaaatgaaagataaa 
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(B) - Location of splice sites within the cDNA sequence 



1 GAATTCCCCC TTTTGTTTAA ACCTTGGGAA CTGGTTCAGC TCCAGGTTTT GCTTTGATCC 

61 TTTTCAAAAA CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT 

121 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG 

181 CCTTCCCCTG GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG 



+3 WSZ.fGLLLX.rS 
241 AGTGAGCTCT CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG 

+3 A L A G Q R Q G T Q A ESN L S S K F Q 
301 CCCTGGCCGG CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT 

+3FSSN K E Q Jf G V Q D . P Q HER I I T 
361 TTTCCAGCAA CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG 

+3 V S T N G S I K S P R F P H T Y P R N T 
422 TGTCTACTAA TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG 



+3 V L V W R L V A V E E N V W I Q L T F D 
4$1 TCTTGGTATG GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTT7GATG 

+3 E R F G LED P E D D I C K Y D F V E V 
541 AAAGATTTGG GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG 

+3EEPS DGT ILGR W C G S G T V P G 
€01 AGGAACCCAG TGATGGAACT AT ATT AGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA 

+3 K Q I S X G N Q I R I R F V S D E Y F P 
661 AACAGATTTC TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT 

+3 S £ P |g ? C I KVNI VMP .QFT E A V 
721 CTGAACCAGG GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA 

+3SPSV L?P SALP LDL LNN AIT 
781 GTCCTTCAGT GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG 

+3AFST LSD L I R Y LEP ERW QLD 
841 CCTTTAGTAC CTTGGAAGAC CT7ATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT 

+3LEDL Y R P TWQL LGK A F V FGR 
901 TAGAAGATCT ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TT7GGAAGAA 



+ 3KSRV VDL KLLT E 
961 AATCCAGAGT GGTGGA7CTG AACCTTCTAA CAGA< 




R L Y S C T 
T AAGATTATAC AGCTGCACAC 



+ 3PRNF SVS IREE.LKR TDT IFW 
1021 CTCGTAACTT C7CAGTSTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG NCA CCL HNC 
1081 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAAT7GCA 



♦ 3 N E C Q CVP S X V T K K Y KS 
1141 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA C7AAAAAATA 



V L 0 L 
CCACGAGGTC CTTCAGTTGA 
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+ 3RPKT GVR GLHK SLT D V A LEH 

1201 GACCAAAGAC CGG7GTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+3HEEC DCV CRGS TGG 

1261 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

1321 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

1381 CTCCATCCTT AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 AT7C7GAAAG AGGAGACATC AAACAGAATT AGGAGT7G7G CAACAGCTCT TTTGAGAGGA 

1501 GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1561 ATAGATCACC AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATTTTTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1861 CATGCTGATA GGACAGACTG GATTTTTCAT ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACATTCATGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

1981 TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTG7A CATTTTTATA TTCTCCTTTT 

2041 GACATTATAA CTGtTGGCTT TTCTAATCTT GTTAAATA7A TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT TTTTTATGAC AACTTAGATC AACTATTT7T AGCTTGGTAA A7TTTTCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTGTTGCT CTGACAAAAA 

2221 TACATG7ATT 7CATTC7CGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT 7AAAAAACTG 

2281 AA7TGGAATA GAAT7GG7AA GT7GCAAAGA C77TTTGAAA A7AAT7AAA7 7A7CA7ATCT 

2341 7CCA77CCTG 77AT7GGAGA TGAAAA7AAA AAGCAAC77A 7GAAAG7AGA CATTCAGATC 

2401 CAGCCA77AC TAACCTA7TC C7T7TT7GGG GAAATC7GAG CCTAGC7CAG AAAAACA7AA 

2461 AGCACCT7GA AAAAGACTTG GCAGCT7CC7 GA7AAAGCG7 GCTGTGC7G7 GCAG7AGGAA 

2521 CACA7CC7A7 7TAT7GTGAT G77GTGG777 TA7TA7C77A AACTC7G77C CA7ACAC77G 

2581 7A7AAA7ACA TCCATA7777 7A7G7ACAGA AG7ATG7C7C TTAACCAGTT CACT7A7TG7 

2641 ACCTGGAAGG GCGAA7TCTG CAGATA7C 
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Applicant's nraiienl's 

lilciclcrvncc ' B0192/7011WO 



Internal it innlapplicatiiHi No. 
PCT/US99/30503 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule \3bis) 



A. The indications made below relate lo the deposited microorganism or other biological material referred to in the description 
on page j-* .line 15-16 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet | | 



Name of depositary institution 

BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS (BCCM) m 
LABORATORIUM VOOR MOLECULAIRE BIOLOGIE - PLASMIDENCOLLECTIE (LMBP) 



Address of depositary institution (including postal code and country} 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Dale or deposit 


Accession Number 


20 December 1999 (20.12.99) 


I21BP 3991 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States} 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later f.yvq/i- the general nature of the indtauiotvi eg. "Accessuw 
Number of DefXisit") 



For receiving Office use only 



n This sheet was received with the international application 



Authorized olVteer 



I H i n i i I ■ i )i PH ' m i l MJ 



This sheet was received bv the jHtematiojLnlUurcau on: 

19 APRIL 2000 < 13^*y ' 


Authorized officer £j\ e n MoyS© )\ 
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Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure 

Receipt in the case of an original deposit issued pursuant to Rule 7.1 by the 
International Depositary Authority BCCM™/LMBP identified at the bottom of next page 

International Form BCCM™/LMBP/BP/4/99-23 



To : Name of the depositor : Janssen Phanmaceutica N.V. 



Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



I. Identification of the microorganism: 

1.1 Identification reference given by the depositor: 

VEGF-X CUB PET22b 



1. 2 Accession number given by the International Depositary Authority: 



LMBP 3991 
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LMBP-COLLECTION 

Page 2 of Form BCCM™/LMBP/BP/4/99-23 Receipt in the case of an original deposit 

II. Scientific description and/or proposed taxonomic designation 
The microorganism identified under I above was accompanied by: 

(mark with a cross the applicable box(es)) 

- a scientific description yes E><3 no Q 

- a proposed taxonomic designation yes □ no 13 

III. Receipt and acceptance 

This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on (date of original deposit) : December 20, 1999 



IV. International Depositary Authority 



Belgian Coordinated Collections of Microorganisms (BCCM™) 

Laboratorium voor Moleculaire Biologie - PiasmidencoIIectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized official(s): 




Date : January 1 2, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 
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Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure 

Viability statement issued pursuant to Rule 10.2 by the International Depositary 
Authority BCCM™/LMBP identified on the following page 

Internationa/ Form BCCM™/LMBP/BP/9/99-23 



To : Party to whom the viability statement is issued: 
Name : Dr Filip De Cone 

Address : Janssen Pharmaceutica N.V. 

Turnhoutseweg 30 
B-2340 Beerse 
Belgium 

L Depositor 

1.1 Name : Janssen Pharmaceutica N.V. 



I.2 Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



II. Identification of the microorganism: 

11.1 Accession number given by the International Depositary Authority: 



LMBP 3991 



II. 2 Date of the original deposit (or where a new deposit or a transfer has been 
made, the most recent relevant date) : December 20, 1999 

III. Viability statement. 

The viability of the microorganism identified under II above was tested on 

: January 1 1 , 2000 

(Give date. In the cases referred to in Rule 10.2(a)(ii) and (iii), refer to the most recent 
viability test). 

On that date, the said microorganism was: (mark the applicable box with a cross) 

[X] viable 

O no longer viable 
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IV. Conditions under which the viability test has been performed: 

' (Fill in if the information has been requested and if the results of the test were 
negative). 



V. International Depositary Authority 



Belgian Coordinated Collections of Microorganisms (BCCM 1M ) 

Laboratorium voor Moleculaire Biologie - Plasmidencollectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 

Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized official(s): 




Date : January 1 2, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 



